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1 . THIS NEW APPLICATION IS A NATIONAL STAGE 
APPLICATION UNDER PCT. CHAPTER II WITH A REQUEST FOR 
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DESIGNATED/ELECTED OFFICE (DO/EO/US). 
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371; 
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under 35 U.S.C. § 371. 
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limit 35 U.S.C. 371(b) and PCT Articles 22 and 39(1). 
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3 A proper Demand for International Preliminary Examination was made by the 1 9™ 
month from the earliest claimed priority date. 

4. Enclosed items are required for filing under 37 CFR § 1 .53(b) and § 1 .494(b) or § 
1.495(b): 

One copy of International Publication WO 01/23417 

g| is transmitted herewith (required only if not transmitted by the 
International Bureau) 

has been transmitted by the International Bureau 

is not required, as the application was filed in the United States 
Receiving Office (RO/US) 

Fees 



(a) 

(b) 
(c) 



El 

□ 



The basic national fee set forth in 37 CFR § 1 .482 - International Preliminary 
Examination Fee not paid to USPTO but International Search Report prepared by the EPO or 
JPO - $860.00 
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Claims in Excess of 20 (12 @ $18.00) 
Independent Claims in Excess of 3 (1 @ $80.00) 



Further enclosed are: 



IE! 



One copy of International Preliminary Examination Report. 
One copy of International Search Report - (with references) 



6. Information Disclosure Statement: 

• The Examiner is requested to consider carefully the complete text of the documents 
submitted herewith in connection with the examination of this application. It is believed that 
the Examiner will concur with Applicant's belief that the documents do not adversely affect 
the patentability of the subject matter presently claimed, taken alone or in combination. 

• It is requested that the listed documents be included in the "References Cited" portion 
of any patent issuing from this application. 

• Under 37 CFR 1 .97(i), Applicants understand that non-complying Information 
Disclosure Statements will be placed in the file but not considered by the Office, however, 
under Reply to Comment 8 of the Federal Register, page 2024, Applicants will be informed 
when information is not considered. 

[X] Applicants make of record the enclosed documents, which include those documents 
identified in the International Search Report or in the International Preliminary 
Examination Report. These documents are listed on the enclosed Forms PTO/SB/08A 
and PTO/SB/08B. 

7. | | A translation of the International Application into English (35 U.S.C. § 
371(c)(2)) 

8. □ Amendments to the claims of the International Application under PCT Article 
19 (35 U.S.C. § 371(c)(3)) 
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9. □ A translation of the amendments to the claims under PCT Article 1 9 (35 
U.S.C. § 371(c)(3)) 

Still additional papers enclosed: 

[X] Assignment with Assignment Recordation Form Cover Sheet 
| | Verification Statement Claiming Small Entity Status 
[X] Declaration or oath is enclosed executed by the inventor 
^| Return Acknowledgment Postcard 



11. Nucleotide and/or Amino Acid Sequence Submission (if applicable, all necessary) 
A. Enclosed are: 



B 
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(a) 
(b) 



□ 
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Computer Readable Copy of the Sequence Listing 
Paper Copy (identical to Computer Readable Copy) of the 
Sequence Listing 

Enclosed is a paper copy of the Sequence Listing. This paper copy and 
a Computer Readable Form thereof are identical with the Computer 
Readable Form in another application of the Applicant which is fully 
identified as follows: 

U.S. Application No, 

Filed: 

Attorney Docket No.: 
which is believed to comply with the rules set forth in 37 CFR § 1.821 
et. seq. Applicants requests pursuant to 37 CFR § 1 .821(e) that this 
Computer Readable Form be used in the present application. Please 
TRANSFER the sequence listing from the parent to this application. 
Statement under 37 CFR § 1.821(f): The information recorded in 
computer readable form is identical to the written Sequence Listing. 
Statement under 37 CFR § 1 .821(g) (required when Sequence Listing 
not submitted at the time of filing under 35 U.S.C. §1 1 1(a)) or 37 CFR 
§1.82 1(f) (required when Sequence Listing not submitted at the time of 
filing under the Patent Cooperation Treaty): The submission of the 
Sequence Listing includes no new matter. 

Amendment: Please enter the Sequence Listing into the application, 
replacing the prior Sequence Listing. 
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Preliminary Amendment 

Prior to calculation of fees, kindly enter: 

[X] Preliminary Amendment submitted herewith 

I I do not enter Preliminary Amendment 

Fee payment being made at this time is enclosed: 

* Basic filing fee ($860.00) 

Claims in Excess of 20 
(12 @ $18.00) 

Independent Claims in Excess of 3 
(1 @$80 00) 

Filing Assignment (1 @ $40.00) 
Total Fees enclosed: 
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860.00 
216.00 

80 00 

40.00 
$1196.00 



14. The Commissioner is hereby authorized to charge any fees which may be required, or 
credit any overpayment to Account No. 50-0258. This letter is filed in duplicate for 
accounting purposes. 



Date: April 1, 2002 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant: Coche et al. 

International Application No. PCT/EPOO/9500 

Docket No.: BC45263 Group Art Unit No.: Unknown 

U.S. Serial No.: Unknown Examiner: Unknown 

Filed: Herewith 

For: Human Tumor-Associated LAK-4P Related Polynucleotides and Polypeptides 
and Their Uses. 



PRELIMINARY AMENDMENT 

Sir: 

Applicant respectfully requests that this Preliminary Amendment be entered in this case 
before the calculation of fees and before examination of the subject application. 

In the Claims: 

Please delete the claims of the application as filed in the PCT and substitute therefor: 

38. An isolated polypeptide comprising a member selected from the group consisting of 

(a) an amino acid sequence which has at least 90% identity to SEQ ID NO:2 or 4; 

(b) an immunogenic fragment of the amino acid sequence of (a), wherein the 
immunogenic fragment is at least 90% identical to an aligned contiguous 
segment of SEQ ID NO:2 or 4; and 

(c) an immunogenic fragment of the amino acid sequence of (a) that matches an 
aligned contiguous segment of SEQ ID NO:2 or 4 with no more than five single 
amino acid substitutions, deletions or additions, 

wherein the isolated polypeptide, when administered to a subject in a suitable composition 
which can include an adjuvant or a carrier coupled to the polypeptide, induces an immune 
response that recognizes a polypeptide having the sequence of SEQ ID NO:2 or 4. 

39. The isolated polypeptide of Claim 38 wherein the amino acid sequence has at least 95% 
identity to SEQ ID NO:2 or 4 or the aligned contiguous segment of SEQ ID NO:2 or 4. 

40. The isolated polypeptide of Claim 39 wherein the polypeptide has the amino acid 
sequence of SEQ ID NO:2 or 4. 
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41. A fusion protein comprising the isolated polypeptide of Claim 38. 

42. The isolated polypeptide of Claim 38 wherein the polypeptide is the immunogenic 
fragment having no more than two single amino acid substitutions, deletions or additions 
relative to the aligned sequence. 

43. The isolated polypeptide of Claim 38 wherein the polypeptide is the immunogenic 
fragment having no more than one single amino acid substitution, deletion or addition relative 
to the aligned sequence. 

44. The isolated polypeptide of Claim 38 wherein the polypeptide is the immunogenic 
fragment which matches the aligned sequence. 

45. An isolated polypeptide encoded by an isolated first polynucleotide wherein the 
isolated first polynucleotide hybridizes under stringent conditions to a second polynucleotide 
which encodes the polypeptide of SEQ ID NO:2 or 4; wherein stringent conditions comprise 
overnight incubation at 42° C. in a solution comprising: 50% formamide, 5><SSC (150 mM 
NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardf s solution, 

1 0% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed 
by washing the filters in 0.1 x SSC at about 65° C; wherein the isolated polypeptide, when 
administered to a subject, induces an immune response that recognizes a polypeptide having 
the sequence of SEQ ID NO:2 or 4. 

46. An isolated polynucleotide encoding the polypeptide of Claim 38. 

47. An expression vector comprising the isolated polynucleotide of Claim 46. 

48. A host cell transformed with the expression vector of Claim 47. 

49. A process of producing an isolated polypeptide comprising (a) culturing the host cell of 
Claim 48 under conditions sufficient for the production of the encoded polypeptide and (b) 
recovering the polypeptide. 
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50. An immunogenic composition comprising the isolated polynucleotide of Claim 46 or 
an expression vector comprising the isolated polynucleotide, effective in a vaccinated mammal 
to express the polypeptide. 

5 1 . An immunogenic composition comprising the isolated polynucleotide of Claim 46 or 
an expression vector comprising the isolated polynucleotide comprised within a microorganism 
effective itself or through its host to express the polypeptide. 

52. An isolated polynucleotide segment comprising a polynucleotide sequence or the full 
complement of the entire length of the polynucleotide sequence, wherein the polynucleotide 
sequence is identical to SEQ ID NO:l or 3 minus any terminal stop codon, except that, over 
the entire length corresponding to SEQ ID NO:l or 3 minus any terminal stop codon, n n 
nucleotides are substituted, inserted or deleted, wherein n n satisfies the following expression 

n n < x n - (x n • y) 

wherein x„ is the total number of nucleotides in SEQ ID NO: 1 or 3 minus any terminal stop 
codon, y is at least 0.90, and wherein any non-integer product of x n and y is rounded down to 
the nearest integer before subtracting the product from x n ; and wherein the polynucleotide 
sequence detects a polynucleotide of SEQ ID NO: 1 or 3 minus any terminal stop codon. 

53. The isolated polynucleotide of Claim 52 where y is at least 0.95. 

54. An expression vector comprising the isolated polynucleotide of Claim 52 which codes 
for a polypeptide that, when administered to a mammal, induces an immune response that 
recognizes a polypeptide having the sequence of SEQ ID NO:2 or 4. 

55. A host cell transformed with the isolated polynucleotide or an expression vector 
comprising the isolated polynucleotide of Claim 52. 

56. A process of producing an isolated polypeptide comprising (a) culturing the host cell of 
Claim 55 under conditions sufficient for the production of the encoded polypeptide and (b) 
recovering the polypeptide. 

57. An immunogenic composition comprising the polypeptide of Claim 38. 
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58. The immunogenic composition of Claim 57 further comprising an adjuvant. 

59. The immunogenic composition of Claim 58 wherein the adjuvant induces a THl-type 
response. 

60. The immunogenic composition of Claim 58 the adjuvant is a member selected from the 
group consisting of 3D-MPL, QS21, a mixture of QS21 and cholesterol, and a CpG 
oligonucleotide. 

61 . An immunogenic composition comprising an effective amount of antigen presenting 
cells, (i) modified by in vitro loading with a polypepide of claim 38, or (ii) genetically 
modified to express a polypeptide of claim claim 38; and a pharmaceutically acceptable carrier. 

62. A method for inducing an immune response in a mammal comprising administration of 
the polypeptide of Claim 38. 

63. A method for screening to identify compounds which stimulate or which inhibit the 
function of the polypeptide of Claim 38 which comprises a method selected from the group 
consisting of: 

(a) measuring the binding of a candidate compound to the said polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof by means of a label directly or 
indirectly associated with the candidate compound; 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof in the presence of a labeled 
competitor; 

(c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the said polypeptide, using detection systems appropriate to the cells or cell 
membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing the polypeptide of Claim 38, to 
form a mixture, measuring activity of the polypeptide in the mixture, and comparing the 
activity of the mixture to a standard; or 
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(e) detecting the effect of a candidate compound on the production of mRNA encoding said 
polypeptide and said polypeptide in cells. 

64. A method for the treatment of a subject by immunoprophylaxis or therapy comprising 
in vitro induction of immune responses to a polypeptide of Claim 38, using in vitro incubation 
of the polypeptide with cells from the immune system of a mammal, and reinfusing these 
activated immune cells to the mammal for the treatment of disease. 

65 A method as claimed in Claim 64 wherein the treatment is for ovarian or colon cancer. 

66. A process for diagnosing a disease or a susceptibility to a disease in a subject related to 
expression or activity of the polypeptide of Claim 38 in a subject comprising: analyzing for 
the presence or amount of said polypeptide expression in a sample derived from said subject. 

67. The process of claim 66, wherein the disease is colon cancer. 

68. An isolated polypeptide comprising a member selected from the group consisting of 

(a) an amino acid sequence which has at least 90% identity to SEQ ID NO:6 or 8; 

(b) an immunogenic fragment of the amino acid sequence of (a) that matches an 
aligned contiguous segment of SEQ ID NO: 6 or 8 with no more than three singles 
amino acid substitutions, deletions or additions; and 

(c) an immunogenic fragment of the amino acid sequence of (a), wherein the 
immunogenic fragment is at least 90% identical to the aligned contiguous segment of 
SEQ ID NO:6 or 8, 

wherein the isolated polypeptide, when adminstered to a subject, induces an immune response 
that recognizes a polypeptide having the sequence of SEQ ID NO: 6 or 8. 

69. An isolated polynucleotide encoding the polypeptide of Claim 68. 
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REMARKS 
Claims 

Claims 1-37 have been canceled without prejudice or disclaimer of the subject matter 
therein. Applicant reserves the right to prosecute, in one or more patent applications, the 
canceled claims, the claims to non-elected inventions, the claims as originally filed, and any 
other claims supported by the specification. 

New claims 38-69 have been introduced. 

Su pport 

Support for the new claims is either apparent, or is as described in the text below. 
Particularly, support for the recitation of "five single amino acid substitutions, deletions or 
additions" may be found, for example, at page 7, lines 3 1-32. Support for the stringent 
hybridization conditions may be found, for example, at page 13, lines 6-10. Support for the 
recitation of sequence relatedness such as that found in that found in claim 52 may be found in 
the specification, for example, at pages 44, line 23 through page 45, line 10. No new matter is 
added. 

Closing Remarks 

The Notice of Allowance is earnestly solicited. 



DECHERT 

Princeton Pike Corporate Center 
P.O. Box 5218 

Princeton, New Jersey 08543-52 1 8 
Allen Bloom (609) 620-3214 
Arthur E. Jackson (609) 620-3254 
Fax: (609) 620-3259 
Attn: Allen Bloom, Esq. 



Respectfully submitted, 




Arthur E. Jacksj 
Registration No. 34,354 
Allen Bloom 
Registration No. 29,135 
Attorneys for Applicants 
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Novel Compounds 

The present invention relates to polynucleotides, herein referred to as CASB641 1 
polynucleotides, polypeptides encoded thereby (referred to herein as CASB641 1 
5 polypeptides), recombinant materials and methods for their production. In another aspect, 
the invention relates to methods for using such polypeptides and polynucleotides, including 
the treatment of cancer and autoimmune diseases and other related conditions. In a further 
aspect, the invention relates to methods for identifying agonists and antagonists/inhibitors 
using the materials provided by the invention, and treating conditions associated with 
10 CASB641 1 polypeptide imbalance with the identified compounds. In a still further 
aspect, the invention relates to diagnostic assays for detecting diseases associated with 
inappropriate CASB641 1 polypeptide activity or levels. 

Polypeptides and polynucleotides of the present invention are believed to be important 
15 immunogens for specific prophylactic or therapeutic immunization against tumours, because 
they are specifically expressed or highly over-expressed in tumours compared to normal 
cells and can thus be targeted by antigen-specific immune mechanisms leading to the 
destruction of the tumour cell. They can also be used to diagnose the occurrence of tumour 
cells. Furthermore, their inappropriate expression in certain circumstances can cause an 
20 induction of autoimmune, inappropriate immune responses, which could be corrected 
through appropriate vaccination using the same polypeptides or polynucleotides. In this 
respect the most important biological activities to our purpose are the antigenic and 
immunogenic activities of the polypeptide of the present invention. A polypeptide of the 
present invention may also exhibit at least one other biological activity of a CASB641 1 
25 polypeptide, which could qualify it as a target for therapeutic or prophylactic intervention 
different from that linked to the immune response. 

In a first aspect, the present invention relates to CASB641 1 polypeptides. Such peptides 
include isolated polypeptides comprising an amino acid sequence which has at least 70% 
30 identity, preferably at least 80% identity, more preferably at least 90% identity, yet more 
preferably at least 95% identity, most preferably at least 97-99% identity, to that of SEQ 
ID NO:2 OR 4 over the entire length of SEQ ID NO:2 OR 4. Such polypeptides include 
those comprising the amino acid of SEQ ID NO:2 OR 4. 

1 



WO 01/23417 



EgSflEPOO/09500 



Further peptides of the present invention include isolated polypeptides in which the 
amino acid sequence has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, most 
preferably at least 97-99% identity, to the amino acid sequence of SEQ ID NO:2 OR 4 
over the entire length of SEQ ID NO:2 OR 4. Such polypeptides include the polypeptide 
ofSEQIDNO:2 0R4. 

Further peptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising the sequence contained in SEQ ID NO:l. 

The invention also provides an immunogenic fragment of a CASB641 1 polypeptide, that is 
a contiguous portion of the CASB641 1 polypeptide which has the same or similar 
immunogenic properties to the polypeptide comprising the amino acid seqeunce of SEQ ID 
NO:2 OR 4. That is to say, the fragment (if necessary when coupled to a carrier) is capable 
of raising an immune response which recognises the CASB641 1 polypeptide. Such an 
immunogenic fragment may include, for example, the CASB641 1 polypeptide lacking an N- 
terminal leader sequence, a transmembrane domain or a C-terminal anchor domain. In a 
preferred aspect the immunogenic fragment of CASB641 1 according to the invention 
comprises substantially all of the extracellular domain of a polypeptide which has at least 
70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet 
more preferably at least 95% identity, most preferably at least 97-99% identity, to that of 
SEQ ID NO:2 OR 4 over the entire length of SEQ ID NO:2 OR 4. Preferably an 
immunogenic fragment according to the invention comprises at least one epitope. 

Peptide fragments incorporating an epitope of CASB641 1 typically will comprise at least 
7, preferably 9 or 10 contiguous amino acids from SEQ ID NO:2 OR 4. Preferred 
epitopes are shown in SEQ ID NO:9 to SEQ ID NO:72. 

Peptides that incorporate these epitopes form a preferred aspect of the present invention. 
Mimotopes which have the same characteristics as these epitopes, and immunogens 
comprising such mimotopes which generate an immune response which cross-react with 
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an epitope in the context of the CASB641 1 molecule, also form part of the present 
invention. 

The present invention, therefore, includes isolated peptides encompassing these epitopes 
themselves, and any mimotope thereof. The meaning of mimotope is defined as an entity 
which is sufficiently similar to the native CASB641 1 epitope so as to be capable of being 
recognised by antibodies which recognise the native molecule; (Gheysen, H.M., et aL, 
1986, Synthetic peptides as antigens. Wiley, Chichester, Ciba foundation symposium 
119, pl30-149; Gheysen, H.M., 1986, Molecular Immunology, 23,7, 709-715); or are 
capable of raising antibodies, when coupled to a suitable carrier, which antibodies cross- 
react with the native molecule. 

Peptide mimotopes of the above-identified epitopes may be designed for a particular 
purpose by addition, deletion or substitution of elected amino acids. Thus, the peptides of 
the present invention may be modified for the purposes of ease of conjugation to a protein 
carrier. For example, it may be desirable for some chemical conjugation methods to 
include a terminal cysteine to the epitope. In addition it may be desirable for peptides 
conjugated to a protein carrier to include a hydrophobic terminus distal from the 
conjugated terminus of the peptide, such that the free unconjugated end of the peptide 
remains associated with the surface of the carrier protein. This reduces the 
conformational degrees of freedom of the peptide, and thus increases the probability that 
the peptide is presented in a conformation which most closely resembles that of the 
peptide as found in the context of the whole molecule. For example, the peptides may be 
altered to have an N-terminal cysteine and a C-terminal hydrophobic ami dated tail. 
Alternatively, the addition or substitution of a D-stereoisomer form of one or more of the 
amino acids may be performed to create a beneficial derivative, for example to enhance 
stability of the peptide. Those skilled in the art will realise that such modified peptides, or 
mimotopes, could be a wholly or partly non-peptide mimotope wherein the constituent 
residues are not necessarily confined to the 20 naturally occurring amino acids. In 
addition, these may be cyclised by techniques known in the art to constrain the peptide 
into a conformation that closely resembles its shape when the peptide sequence is in the 
context of the whole molecule. A preferred method of cyclising a peptide comprises the 
addition of a pair of cysteine residues to allow the formation of a disulphide bridge. 

3 
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Further, those skilled in the art will realise that mimotopes or immunogens of the present 
invention may be larger than the above-identified epitopes, and as such may comprise the 
sequences disclosed herein. Accordingly, the mimotopes of the present invention may 
consist of addition of N and/or C terminal extensions of a number of other natural 
residues at one or both ends. The peptide mimotopes may also be retro sequences of the 
natural sequences, in that the sequence orientation is reversed; or alternatively the 
sequences may be entirely or at least in part comprised of D-stereo isomer amino acids 
(inverso sequences). Also, the peptide sequences may be retro-inverso in character, in 
that the sequence orientation is reversed and the amino acids are of the D-stereoisomer 
form. Such retro or retro-inverso peptides have the advantage of being non-self, and as 
such may overcome problems of self-tolerance in the immune system. 

Alternatively, peptide mimotopes may be identified using antibodies which are capable 
themselves of binding to the epitopes of the present invention using techniques such as 
phage display technology (EP 0 552 267 Bl). This technique, generates a large number of 
peptide sequences which mimic the structure of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native peptide. This approach may have 
significant advantages by allowing the possibility of identifying a peptide with enhanced 
immunogenic properties, or may overcome any potential self-antigen tolerance problems 
which may be associated with the use of the native peptide sequence. Additionally this 
technique allows the identification of a recognition pattern for each native-peptide in 
terms of its shared chemical properties amongst recognised mimotope sequences. 

The covalent coupling of the peptide to the immunogenic carrier can be carried out in a 
manner well known in the art. Thus, for example, for direct covalent coupling it is 
possible to utilise a carbodiimide, glutaraldehyde or (N-[y-maleimidobutyryloxy] 
succinimide ester, utilising common commercially available heterobifunctional linkers 
such as CDAP and SPDP (using manufacturers instructions). After the coupling reaction, 
the immunogen can easily be isolated and purified by means of a dialysis method, a gel 
filtration method, a fractionation method etc. 
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The types of carriers used in the immunogens of the present invention will be readily 
known to the man skilled in the art. The function of the carrier is to provide cytokine help 
in order to help induce an immune response against the peptide. A non-exhaustive list of 
carriers which may be used in the present invention include: Keyhole limpet 
Haemocyanin (KLH), serum albumins such as bovine serum albumin (BSA), inactivated 
bacterial toxins such as tetanus or diptheria toxins (TT and DT), or recombinant 
fragments thereof (for example, Domain 1 of Fragment C of TT, or the translocation 
domain of DT), or the purified protein derivative of tuberculin (PPD). Alternatively the 
mimotopes or epitopes may be directly conjugated to liposome carriers, which may 
additionally comprise immunogens capable of providing T-cell help. Preferably the ratio 
of mimotopes to carrier is in the order of 1:1 to 20:1, and preferably each carrier should 
carry between 3-15 peptides. 

In an embodiment of the invention a preferred carrier is Protein D from Haemophilus 
15 influenzae (EP 0594 610B1). Protein D is an IgD-binding protein from Haemophilus 
influenzae and has been patented by Forsgren (WO 91/18926, granted EP 0 594 610 Bl). 
In some circumstances, for example in recombinant immunogen expression systems it 
may be desirable to use fragments of protein D, for example Protein D 1/3" 1 (comprising 
the N-terminal 100-110 amino acids of protein D (GB 9717953.5)). 

20 

Another preferred method of presenting the peptides of the present invention is in the 
context of a recombinant fusion molecule. For example, EP 0 421 635 B describes the use 
of chimaeric hepadnavirus core antigen particles to present foreign peptide sequences in a 
virus-like particle. As such, immunogens of the present invention may comprise peptides 
25 presented in chimaeric particles consisting of hepatitis B core antigen. Additionally, the 
recombinant fusion proteins may comprise the mimotopes of the present invention and a 
carrier protein, such as NS1 of the influenza virus. For any recombinantly expressed 
protein which forms part of the present invention, the nucleic acid which encodes said 
immunogen also forms an aspect of the present invention. 

30 

Peptides used in the present invention can be readily synthesised by solid phase 
procedures well known in the art. Suitable syntheses may be performed by utilising *T- 
boc" or *T-moc" procedures. Cyclic peptides can be synthesised by the solid phase 
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procedure employing the well-known "F-moc" procedure and polyamide resin in the fully 
automated apparatus. Alternatively, those skilled in the art will know the necessary 
laboratory procedures to perform the process manually. Techniques and procedures for 
solid phase synthesis are described in 'Solid Phase Peptide Synthesis: A Practical 
Approach 1 by E. Atherton and R.C. Sheppard, published by IRL at Oxford University 
Press (1989). Alternatively, the peptides may be produced by recombinant methods, 
including expressing nucleic acid molecules encoding the mimotopes in a bacterial or 
mammalian cell line, followed by purification of the expressed mimotope. Techniques for 
recombinant expression of peptides and proteins are known in the art, and are described 
in Maniatis, T., Fritsch, E.F. and Sambrook et al., Molecular cloning, a laboratory 
manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York 
(1989). 



The polypeptides or immunogenic fragment of the invention may be in the form of the 
"mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
during recombinant production. Furthermore, addition of exogenous polypeptide or lipid 
tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 



In one aspect, the invention relates to genetically engineered soluble fusion proteins 
comprising a polypeptide of the present invention, or a fragment thereof, and various 
portions of the constant regions of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a particular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. Furthermore, 
this invention relates to processes for the preparation of these fusion proteins by genetic 
engineering, and to the use thereof for drug screening, diagnosis and therapy. A further 
aspect of the invention also relates to polynucleotides encoding such fusion proteins. 
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Examples of fusion protein technology can be found in International Patent Application 
Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion proteins 
allowing increased levels to be produced in an expression system as compared to non- 
fused protein. The fusion partner may assist in providing T helper epitopes 
(immunological fusion partner), preferably T helper epitopes recognised by humans, or 
assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological fusion 
partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenza B and the non-structural 
protein from influenzae virus, NS 1 (hemagglutinin). Another immunological fusion 
partner is the protein known as LYT A. Preferably the C terminal portion of the molecule 
is used. Lyta is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L- 
alanine amidase, amidase LYTA, (coded by the lytA gene {Gene, 43 (1986) page 265- 
272} an autolysin that specifically degrades certain bonds in the peptidoglycan backbone. 
The C-terminal domain of the LYTA protein is responsible for the affinity to the choline 
or to some choline analogues such as DEAE. This property has been exploited for the 
development of E.coli C-LYTA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LYTA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-798}. It is possible to 
use the repeat portion of the Lyta molecule found in the C terminal end starting at residue 
178, for example residues 188 - 305. 

The present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the referents by conservative amino acid substitutions, whereby 
a residue is substituted by another with like characteristics. Typical such substitutions are 
among Ala, Val, Leu and He; among Ser and Thr, among the acidic residues Asp and Glu; 
among Asn and Gin; and among the basic residues Lys and Arg; or aromatic residues Phe 
and Tyr. Particularly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino 
acids are substituted, deleted, or added in any combination. 
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Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
in the art. 

In a further aspect, the present invention relates to CASB641 1 polynucleotides. Such 
polynucleotides include isolated polynucleotides comprising a nucleotide sequence encoding 
a polypeptide which has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, to the amino 
acid sequence of SEQ ID NO:2 OR 4, over the entire length of SEQ ID NO:2 OR 4. In this 
regard, polypeptides which have at least 97% identity are highly preferred, whilst those with 
at least 98-99% identity are more highly preferred, and those with at least 99% identity are 
most highly preferred. Such polynucleotides include a polynucleotide comprising the 
nucleotide sequence contained in SEQ ID NO:l or 3 encoding the polypeptide of SEQ ID 
NO:2 OR 4 respectively. 

Further polynucleotides of the present invention include isolated polynucleotides comprising 
a nucleotide sequence that has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, to a nucleotide 
sequence encoding a polypeptide of SEQ ID NO:2 OR 4, over the entire coding region. In 
this regard, polynucleotides which have at least 97% identity are highly preferred, whilst 
those with at least 98-99% identity are more highly preferred, and those with at least 99% 
identity are most highly preferred. 

Further polynucleotides of the present invention include isolated polynucleotides 
comprising a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 
to SEQ ID NO:l or 3 or to the coding sequence of SEQ ID NO:l or 3 over the entire 
length of SEQ ID NO: 1 or 3 or over the entire length of the coding sequence of SEQ ID 
NO: 1 or 3 respectively. In this regard, polynucleotides which have at least 97% identity are 
highly preferred, whilst those with at least 98-99% identiy are more highly preferred, and 
those with at least 99% identity are most highly preferred. Such polynucleotides include a 
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polynucleotide comprising the polynucleotide of SEQ ID NO:l or 3 as well as the 
polynucleotide of SEQ ID NO: 1 or 3 or the coding region of SEQ ID NO:l or 3. Said 
polynucleotide can be inserted in a suitable plasmid or recombinant microrganism vector 
and used for immunization ( see for example Wolff et. al., Science 247:1465-1468 (1990); 
Corr et al., J. Exp. Med. 184:1555-1560 (1996); Doe et. al., Proc. Natl. Acad. Sci. 93:8578- 
8583 (1996)). 

The invention also provides polynucleotides which are complementary to all the above 
described polynucleotides. 

The invention also provides a fragment of a CASB641 1 polynucleotide which when 
administered to a subject has the same immunogenic properties as the polynucleotide of 
SEQ ID NO: 1 or 3. 

The invention also provides a polynucleotide encoding an immunological fragment of a 
CASB641 1 polypeptide as hereinbefore defined. 

The fragments have a level of immunogenic activity of at least about 50%, preferably at 
least about 70% and more preferably at least about 90% of the level of immunogenic 
activity of a polypeptide sequence set forth in SEQ ID NO:2 OR 4 or a polypeptide 
sequence encoded by a polynucleotide sequence set forth in SEQ ID NO: 1 . 

The polypeptide fragments according to the invention preferably comprise at least about 
5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, including all intermediate 
lengths, of a polypeptide composition set forth herein, such as those set forth in SEQ~ID 
NO:2 OR 4, or those encoded by a polynucleotide sequence set forth in a sequence of 
SEQ ID NO: 1. 

The nucleotide sequence of SEQ ID NO:l is a cDNA sequence which comprises a 
polypeptide encoding sequence (nucleotide 349 to 1761) encoding a polypeptide of 460 
amino acids, the polypeptide of SEQ ID NO:2. The nucleotide sequence encoding the 
polypeptide of SEQ ID NO:2 may be identical to the polypeptide encoding sequence 
contained in SEQ ID NO:l or it may be a sequence other than the one contained in SEQ 
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ID NO:l, which, as a result of the redundancy (degeneracy) of the genetic code, also 
encodes the polypeptide of SEQ ID NO:2. The polypeptide of the SEQ ID NO:2 is 
structurally related to other proteins of the LAK-4p family, having homology and/or 
structural similarity with Homo sapiens LAK-4p (GenBank accession BAA24179). 

SEQ ID NO:3 is a cDNA sequence which is an alternative isoform of CASB641 1 SEQ ID 
NO:l, and probably the result of alternative exon splicing. SEQ ID NO:3 comprises a 
polypeptide encoding sequence (nucleotide 382 to 844) encoding a polypeptide of 154 
aminoacids, the polypeptide of SEQ ID NO:4. SEQ ED NO:4 is a truncated form of SEQ ID 
NO:2 polypeptide. The nucleotide sequence encoding the polypeptide of SEQ ID NO:4 
may be identical to the polypeptide encoding sequence contained in SEQ ID NO:3 or it 
may be a sequence other than the one contained in SEQ ID NO:3, which, as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID 
NO:4. 

Preferred polypeptides and polynucleotides of the present invention are expected to have, 
inter alia, similar biological functions/properties to their homologous polypeptides and 
polynucleotides. Furthermore, preferred polypeptides, immunological fragments and 
polynucleotides of the present invention have at least one activity of either SEQ ID NO:l or 
3 or SEQ ID NO:2 OR 4, as appropriate. 

The present invention also relates to partial or other incomplete polynucleotide and 
polypeptide sequences which were first identified prior to the determination of the 
corresponding full length sequences of SEQ ID NO:l or 3 and SEQ ID NO:2 OR 4. 

Accordingly, in a further aspect, the present invention provides for an isolated 
polynucleotide which: 

(a) comprises a nucleotide sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, even more preferably at least 97-99% identity to SEQ ID NO:5 and 7 over the 
entire length of SEQ ID NO:5 and 7, respectively; 

(b) has a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 

10 
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even more preferably at least 97-99% identity, to SEQ ID NO:l or 3 over the entire 
length of SEQ ID NO:5 and 7; 

(c) the polynucleotide of SEQ ID NO:5 and 7; or 

(d) a nucleotide sequence encoding a polypeptide which has at least 70% identity, 
preferably at least 80% identity, more preferably at least 90% identity, yet more 
preferably at least 95% identity, even more preferably at least 97-99% identity, to the 
amino acid sequence of SEQ ID NO:6 and 8, over the entire length of SEQ ID NO:6 and 8 , 
respectively,; 

as well as the polynucleotide of SEQ ID NO:5 and 7. 

The present invention further provides for a polypeptide which: 

(a) comprises an amino acid sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, most preferably at least 97-99% identity, to that of SEQ ED NO:6 OR 8 over the 
entire length of SEQ ID NO:6 and 8; 

(b) has an amino acid sequence which is at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 
most preferably at least 97-99% identity, to the amino acid sequence of SEQ ID NO:6 OR 
8 over the entire length of SEQ ID NO:6 and 8; 

(c) comprises the amino acid of SEQ ID NO:6 and 8; and 

(d) is the polypeptide of SEQ ID NO:6 and 8; 

as well as polypeptides encoded by a polynucleotide comprising the sequence contained 
in SEQ ID NO:5 and 7. 



25 Polynucleotides of the present invention may be obtained, using standard cloning and 
screening techniques, from a cDNA library derived from mRNA in cells of human, 
colorectal tumours, stomach tumors and normal stomach, normal pancreas and pancreas 
tumors, ovarian tumors, lung tumors, and normal prostate, (for example Sambrook et aL, 
Molecular Cloning: A Laboratory Manual, 2 nd Ed., Cold Spring harbor Laboratory Press, 

30 Cold Spring harbor, N.Y. (1989)). Polynucleotides of the invention can also be obtained 
from natural sources such as genomic DNA libraries or can be synthesized using well 
known and commercially available techniques. 
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When polynucleotides of the present invention are used for the recombinant production of 
polypeptides of the present invention, the polynucleotide may include the coding sequence 
for the mature polypeptide, by itself; or the coding sequence for the mature polypeptide in 
reading frame with other coding sequences, such as those encoding a leader or secretory 
sequence, a pre-, or pro- or prepro- protein sequence, or other fusion peptide portions. For 
example, a marker sequence which facilitates purification of the fused polypeptide can be 
encoded. In certain preferred embodiments of this aspect of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and 
described in Gentz et al y Proc Natl Acad Sci USA (1989) 86:821-824, or is an HA tag. The 
polynucleotide may also contain non-coding 5' and 3' sequences, such as transcribed, non- 
translated sequences, splicing and polyadenylation signals, ribosome binding sites and 
sequences that stabilize mRNA 



Further embodiments of the present invention include polynucleotides encoding polypeptide 
variants which comprise the amino acid sequence of SEQ ID NO:2 OR 4 and in which 
several, for instance from 5 to 10, 1 to 5, 1 to 3, 1 to 2 or 1, amino acid residues are 
substituted, deleted or added, in any combination. 



Polynucleotides which are identical or sufficiently identical to a nucleotide sequence 
contained in SEQ ID NO:l or 3, may be used as hybridization probes for cDNA and 
genomic DNA or as primers for a nucleic acid amplification (PCR) reaction, to isolate full- 
length cDNAs and genomic clones encoding polypeptides of the present invention and to 
isolate cDNA and genomic clones of other genes (including genes encoding paralogs from 
human sources and orthologs and paralogs from species other than human) that have a high 
sequence similarity to SEQ ID NO:l or 3. Typically these nucleotide sequences are 70% 
identical, preferably 80% identical, more preferably 90% identical, most preferably 95% 
identical to that of the referent. The probes or primers will generally comprise at least 1 5 
nucleotides, preferably, at least 30 nucleotides and may have at least 50 nucleotides. 
Particularly preferred probes will have between 30 and 50 nucleotides. Particularly 
preferred primers will have between 20 and 25 nucleotides. In particular, polypeptides or 
polynucleotides derived from sequences from homologous animal origin could be used as 
immunogens to obtain a cross-reactive immune response to the human gene. 
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A polynucleotide encoding a polypeptide of the present invention, including homologs from 
species other than human, may be obtained by a process which comprises the steps of 
screening an appropriate library under stringent hybridization conditions with a labeled 
probe having the sequence ofSEQ ID NO: 1 or 3 or a fragment thereof; and isolating full- 
length cDNA and genomic clones containing said polynucleotide sequence. Such 
hybridization techniques are well known to the skilled artisan. Preferred stringent 
hybridization conditions include overnight incubation at 42°C in a solution comprising: 50% 
formamide, 5xSSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate 
(pH7.6), 5x Denhardt's solution, 10 % dextran sulfate, and 20 microgram/ml denatured, 
sheared salmon sperm DNA; followed by washing the filters in 0.1 x SSC at about 65°C. 
Thus the present invention also includes polynucleotides obtainable by screening an 
appropriate library under stingent hybridization conditions with a labeled probe having the 
sequence of SEQ ID NO: 1 or 3 or a fragment thereof. 

The skilled artisan will appreciate that, in many cases, an isolated cDNA sequence will be 
incomplete, in that the region coding for the polypeptide is short at the 5' end of the 
cDNA. 

There are several methods available and well known to those skilled in the art to obtain 
full-length cDNAs, or extend short cDNAs, for example those based on the method of 
Rapid Amplification of cDNA ends (RACE) (see, for example, Frohman et aL, PNAS 
USA 85, 8998-9002, 1988). Recent modifications of the technique, exemplified by the 
Marathon™ technology (Clontech Laboratories Inc.) for example, have significantly 
simplified the search for longer cDNAs. In the Marathon™ technology, cDNAs have 
been prepared from mRNA extracted from a chosen tissue and an 'adaptor 1 sequence 
ligated onto each end. Nucleic acid amplification (PCR) is then carried out to amplify the 
'missing' 5' end of the cDNA using a combination of gene specific and adaptor specific 
oligonucleotide primers. The PCR reaction is then repeated using 'nested' primers, that is, 
primers designed to anneal within the amplified product (typically an adaptor specific 
primer that anneals further 3' in the adaptor sequence and a gene specific primer that 
anneals further 5' in the known gene sequence). The products of this reaction can then be 
analysed by DNA sequencing and a full-length cDNA constructed either by joining the 
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product directly to the existing cDNA to give a complete sequence, or carrying out a 
separate full-length PCR using the new sequence information for the design of the 5' 
primer. 

Recombinant polypeptides of the present invention may be prepared by processes well 
known in the art from genetically engineered host cells comprising expression systems. 
Accordingly, in a further aspect, the present invention relates to an expression system which 
comprises a polynucleotide of the present invention, to host cells which are genetically 
engineered with such expression sytems and to the production of polypeptides of the 
invention by recombinant techniques. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. 

For recombinant production, host cells can be genetically engineered to incorporate 
expression systems or portions thereof for polynucleotides of the present invention. 
Introduction of polynucleotides into host cells can be effected by methods described in many 
standard laboratory manuals, such as Davis et aL, Basic Methods in Molecular Biology 
(1986) and Sambrook et aL, Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Preferred such methods 
include, for instance, calcium phosphate transfection, DEAE-dextran mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, 
transduction, scrape loading, ballistic introduction or infection. 

Preferably the proteins of the invention are coexpressed with thioredoxin in trans (TIT). 
Coexpression of thioredoxin in trans versus in cis is preferred to keep antigen free of" 
thioredoxin without the need for protease. Thioredoxin coexpression eases the 
solubilisation of the proteins of the invention. Thioredoxin coexpression has also a 
significant impact on protein purification yield, on purified-protein solubility and quality. 

Representative examples of appropriate hosts include bacterial cells, such as Streptococci, 
Staphylococci, E. coli, Streptomyces and Bacillus subtilis cells; fungal cells, such as yeast 
cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; 



14 



rJ« *u,i> ti*xA 4 n.„v ,„i" .A i}„,1». ,.,S' ui *i...p » »J.,«: } ii.,,fi n.> .. 

* 

WO 01/23417 PCTyEPO0/O9SO0 

animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 and Bowes melanoma 
cells; and plant cells. 

A great variety of expression systems can be used, for instance, chromosomal, episomal 
5 and virus-derived systems, e.g., vectors derived from bacterial plasmids, from 

bacteriophage, from transposons, from yeast episomes, from insertion elements, from 
yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as 
SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and 
retroviruses, and vectors derived from combinations thereof, such as those derived from 

10 plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression systems may contain control regions that regulate as well as engender 
expression. Generally, any system or vector which is able to maintain, propagate or 
express a polynucleotide to produce a polypeptide in a host may be used. The appropriate 
nucleotide sequence may be inserted into an expression system by any of a variety of 

15 well-known and routine techniques, such as, for example, those set forth in Sambrook et 
al. 9 Molecular Cloning, A Laboratory Manual (supra). Appropriate secretion signals may 
be incorporated into the desired polypeptide to allow secretion of the translated protein 
into the lumen of the endoplasmic reticulum, the periplasmic space or the extracellular 
environment. These signals may be endogenous to the polypeptide or they may be 

20 heterologous signals. 

The expression system may also be a recombinant live microorganism, such as a virus or 
bacterium. The gene of interest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
25 vivo expression of the antigen and induction of immune responses. 

Therefore, in certain embodiments, polynucleotides encoding immunogenic polypeptides 
of the present invention are introduced into suitable mammalian host cells for expression 
using any of a number of known viral-based systems. In one illustrative embodiment, 
30 retroviruses provide a convenient and effective platform for gene delivery systems. A 
selected nucleotide sequence encoding a polypeptide of the present invention can be 
inserted into a vector and packaged in retroviral particles using techniques known in the 
art. The recombinant virus can then be isolated and delivered to a subject. A number of 
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illustrative retroviral systems have been described (e.g., U.S. Pat. No. 5,219,740; Miller 
and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. (1990) Human Gene 
Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Burns et al. (1993) Proc. 
Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin (1993) Cur. Opin. 
Genet. Develop. 3:102-109. 



In addition, a number of illustrative adenovirus-based systems have also been described. 
Unlike retroviruses which integrate into the host genome, adenoviruses persist 
extrachromosomally thus minimizing the risks associated with insertional mutagenesis 
(Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. Virol. 
67:5911-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et al. 
(1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 



Various adeno-associated virus (AAV) vector systems have also been developed for 
polynucleotide delivery. AAV vectors can be readily constructed using techniques well 
known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; International 
Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. (1988) Molec. Cell. 
Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring Harbor Laboratory 
Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533-539; Muzyczka, N. 
(1992) Current Topics in Microbiol, and Immunol. 158:97-129; Kotin, R M. (1994) 
Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene Therapy 1:165-169; 
and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the nucleic acid molecules encoding 
polypeptides of the present invention by gene transfer include those derived from the pox 
family of viruses, such as vaccinia virus and avian poxvirus. By way of example, vaccinia 
virus recombinants expressing the novel molecules can be constructed as follows. The 
DNA encoding a polypeptide is first inserted into an appropriate vector so that it is 
adjacent to a vaccinia promoter and flanking vaccinia DNA sequences, such as the 
sequence encoding thymidine kinase (TK). This vector is then used to transfect cells 
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which are simultaneously infected with vaccinia. Homologous recombination serves to 
insert the vaccinia promoter plus the gene encoding the polypeptide of interest into the 
viral genome. The resulting TK.sup.(-) recombinant can be selected by culturing the cells 
in the presence of 5-bromodeoxyuridine and picking viral plaques resistant thereto. 



A vaccinia-based infection/trans fection system can be conveniently used to provide for 
inducible, transient expression or coexpression of one or more polypeptides described 
herein in host cells of an organism. In this particular system, cells are first infected in 
vitro with a vaccinia virus recombinant that encodes the bacteriophage T7 RNA 

10 polymerase. This polymerase displays exquisite specificity in that it only transcribes 
templates bearing T7 promoters. Following infection, cells are transfected with the 
polynucleotide or polynucleotides of interest, driven by a T7 promoter. The polymerase 
expressed in the cytoplasm from the vaccinia virus recombinant transcribes the 
transfected DNA into RNA which is then translated into polypeptide by the host 

15 translational machinery. The method provides for high level, transient, cytoplasmic 

production of large quantities of RNA and its translation products. See, e.g., Elroy-Stein 
and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743-6747; Fuerst et al. Proc. Natl. 
Acad. Sci. USA (1986) 83:8122-8126. 



20 Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can also be used 
to deliver the coding sequences of interest. Recombinant avipox viruses, expressing 
immunogens from mammalian pathogens, are known to confer protective immunity when 
administered to non-avian species. The use of an Avipox vector is particularly desirable 
in human and other mammalian species since members of the Avipox genus can only 

25 productively replicate in susceptible avian species and therefore are not infective in 

mammalian cells. Methods for producing recombinant Avipoxviruses are known in the 
art and employ genetic recombination, as described above with respect to the production 
of vaccinia viruses. See, e.g., WO 91/12882; WO 89/03429; and WO 92/03545. 



30 



Any of a number of alphavirus vectors can also be used for delivery of polynucleotide 
compositions of the present invention, such as those vectors described in U.S. Patent Nos. 
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5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based on Venezuelan 
Equine Encephalitis (VEE) can also be used, illustrative examples of which can be found 
in U.S. Patent Nos. 5,505,947 and 5,643,576. 

5 Moreover, molecular conjugate vectors, such as the adenovirus chimeric vectors 

described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al. Proc. 
Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery under the 
invention. 

10 Additional illustrative information on these and other known viral-based delivery systems 
can be found, for example, in Fisher-Hoch et al., Proc. NatL Acad. Set USA 5(5:317-321, 
1989; Flexner et al., Ann. N.Y. Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 
5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; WO 89/01973; 
U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, 

15 Biotechniques (5:616-627, 1988; Rosenfeld et al., Science 252:431-434, 1991; Kolls et al., 
Proc. NatL Acad. Sci. USA P/:215-219, 1994; Kass-Eisler et aL, Proc. NatL Acad. Sci. 
USA 90:1 1498-1 1502, 1993; Guzman et al., Circulation 55:2838-2848, 1993; and 
Guzman et al., Cir. Res. 75:1202-1207, 1993. 

20 The recombinant live microorganisms described above can be virulent, or attenuated in 
various ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention. 

25 In certain embodiments, a polynucleotide may be integrated into the genome of a target 
cell. This integration may be in the specific location and orientation via homologous 
recombination (gene replacement) or it may be integrated in a random, non-specific 
location (gene augmentation). In yet further embodiments, the polynucleotide may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such 

30 polynucleotide segments or "episomes" encode sequences sufficient to permit 

maintenance and replication independent of or in synchronization with the host cell cycle. 
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The manner in which the expression construct is delivered to a cell and where in the cell 
the polynucleotide remains is dependent on the type of expression construct employed. 

In another embodiment of the invention, a polynucleotide is administered/delivered as 
"naked" DNA, for example as described in Ulmer et ah, Science 259: HAS- 1749, 1993 
and reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may 
be increased by coating the DNA onto biodegradable beads, which are efficiently 
transported into the cells. 

In still another embodiment, a composition of the present invention can be delivered via a 
particle bombardment approach, many of which have been described. In one illustrative 
example, gas-driven particle acceleration can be achieved with devices such as those 
manufactured by Powdeiject Pharmaceuticals PLC (Oxford, UK) and Powderject 
Vaccines Inc. (Madison, WI), some examples of which are described in U.S. Patent Nos. 
5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 799. This approach 
offers a needle-free delivery approach wherein a dry powder formulation of microscopic 
particles, such as polynucleotide or polypeptide particles, are accelerated to high speed 
within a helium gas jet generated by a hand held device, propelling the particles into a 
target tissue of interest. 

In a related embodiment, other devices and methods that may be useful for gas-driven 
needle-less injection of compositions of the present invention include those provided by 
Bioject, Inc. (Portland, OR), some examples of which are described in U.S. Patent Nos. 
4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 5,993,412. 

Polypeptides of the present invention can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, ion metal affinity 
chromatography (IMAC) is employed for purification. Well known techniques for 
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refolding proteins may be employed to regenerate active conformation when the polypeptide 
is denatured during intracellular synthesis, isolation and or purification. 

Another important aspect of the invention relates to a method for inducing , re-inforcing 
or modulating an immunological response in a mammal which comprises inoculating the 
mammal with a fragment or the entire polypeptide or polynucleotide of the invention, 
adequate to produce antibody and/or T cell immune response for prophylaxis or for 
therapeutic treatment of cancer and autoimmune disease and related conditions. Yet 
another aspect of the invention relates to a method of inducing, re-inforcing or 
modulating immunological response in a mammal which comprises, delivering a 
polypeptide of the present invention via a vector or cell directing expression of the 
polynucleotide and coding for the polypeptide in vivo in order to induce such an 
immunological response to produce immune responses for prophylaxis or treatment of 
said mammal from diseases. 

A further aspect of the invention relates to an immunological/vaccine formulation 
(composition) which, when introduced into a mammalian host, induces, re-inforces or 
modulates an immunological response in that mammal to a polypeptide of the present 
invention wherein the composition comprises a polypeptide or polynucleotide of the 
invention or an immunological fragment thereof as herein before defined.The vaccine 
formulation may further comprise a suitable carrier. Since a polypeptide may be broken 
down in the stomach, it is preferably administered parenterally (for instance, 
subcutaneous, intramuscular, intravenous, or intradermal injection). Formulations 
suitable for parenteral administration include aqueous and non-aqueous sterile injection 
solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which render 
the formulation isotonic with the blood of the recipient; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The 
formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. 

A further aspect of the invention relates to the in vitro induction of immune responses to a 
fragment or the entire polypeptide or polynucleotide of the present invention or a 
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molecule comprising the polypeptide or polynucleotide of the present invention, using 
cells from the immune system of a mammal, and reinfiising these activated immune cells 
of the mammal for the treatment of disease. Activation of the cells from the immune 
system is achieved by in vitro incubation with the entire polypeptide or polynucleotide of 
the present invention or a molecule comprising the polypeptide or polynucleotide of the 
present invention in the presence or absence of various immunomodulator molecules. 
A further aspect of the invention relates to the immunization of a mammal by 
administration of antigen presenting cells modified by in vitro loading with part or the 
entire polypeptide of the present invention or a molecule comprising the polypeptide of 
the present invention and administered in vivo in an immunogenic way. Alternatively, 
antigen presenting cells can be transfected in vitro with a vector containing a fragment or 
the entire polynucleotide of the present invention or a molecule comprising the 
polynucleotide of the present invention, such as to express the corresponding polypeptide, 
and administered in vivo in an immunogenic way. 
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According to another embodiment, the pharmaceutical compositions described herein will 
comprise one or more immunostimulants in addition to the immunogenic polynucleotide, 
polypeptide, antibody, T-cell and/or antigen presenting cell (APC) compositions of this 
invention. An immunostimulant refers to essentially any substance that enhances or 
potentiates an immune response (antibody and/or cell-mediated) to an exogenous antigen. 
One preferred type of immunostimulant comprises an adjuvant. Many adjuvants contain 
a substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Certain adjuvants 
are commercially available as, for example, Freund's Incomplete Adjuvant and Complete 
Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 65 (Merck and Company, 
Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, PA); aluminum salts such 
as aluminum hydroxide gel (alum) or aluminum phosphate; salts of calcium, iron or zinc; 
an insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically 
derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; 
monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF, interleukin-2, -7, -12, 
and other like growth factors, may also be used as adjuvants. 



Within certain embodiments of the invention, the adjuvant composition is preferably one 
that induces an immune response predominantly of the Thl type. High levels of Thl- 
type cytokines {e.g., IFN-y, TNFa, DL-2 and EL- 12) tend to favor the induction of cell 
mediated immune responses to an administered antigen. In contrast, high levels of Th2- 
type cytokines {e.g., IL-4, EL-5, IL-6 and IL-10) tend to favor the induction of humoral 
immune responses. Following application of a vaccine as provided herein, a patient will 
support an immune response that includes Thl - and Th2-type responses. Within a 
preferred embodiment, in which a response is predominantly Thl -type, the level of Thl - 
type cytokines will increase to a greater extent than the level of Th2-type cytokines. The 
levels of these cytokines may be readily assessed using standard assays. For a review of 
the families of cytokines, see Mosmann and Coffinan, Ann. Rev. Immunol 7:145-173, 
1989. 
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Certain preferred adjuvants for eliciting a predominantly Thl-type response include, for 
example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated 
monophosphoryl lipid A, together with an aluminum salt. MPL® adjuvants are available 
from Corixa Corporation (Seattle, WA; see, for example, US Patent Nos. 4,436,727; 
5 4,877,61 1 ; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in which the 
CpG dinucleotide is unmethylated) also induce a predominantly Thl response. Such 
oligonucleotides are well known and are described, for example, in WO 96/02555, WO 
99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. Immunostimulatory DNA 
sequences are also described, for example, by Sato et al., Science 273:352, 1996. 

10 Another preferred adjuvant comprises a saponin, such as Quil A, or derivatives thereof, 
including QS21 and QS7 (Aquila Biopharmaceuticals Inc., Framingham, MA); Escin; 
Digitonin; or Gypsophila or Chenopodium quinoa saponins . Other preferred 
formulations include more than one saponin in the adjuvant combinations of the present 
invention, for example combinations of at least two of the following group comprising 

15 QS21, QS7, Quil A, P-escin, or digitonin. 



Alternatively the saponin formulations may be combined with vaccine vehicles composed 
of chitosan or other polycationic polymers, polylactide and polylactide-co-glycolide 
particles, poly-N-acetyl glucosamine-based polymer matrix, particles composed of 

20 polysaccharides or chemically modified polysaccharides, liposomes and lipid-based 
particles, particles composed of glycerol monoesters, etc. The saponins may also be 
formulated in the presence of cholesterol to form particulate structures such as liposomes 
or ISCOMs. Furthermore, the saponins may be formulated together with a 
polyoxyethylene ether or ester, in either a non-particulate solution or suspension, or in a 

25 particulate structure such as a paucilamelar liposome or ISCOM. The saponins may also 
be formulated with excipients such as Carbopol R to increase viscosity, or may be 
formulated in a dry powder form with a powder excipient such as lactose. 

In one preferred embodiment, the adjuvant system includes the combination of a 
monophosphoryl lipid A and a saponin derivative, such as the combination of QS21 and 
30 3D-MPL adjuvant, as described in WO 94/00153, or a less reactogenic composition 
where the QS21 is quenched with cholesterol, as described in WO 96/33739. Other 
preferred formulations comprise an oil-in-water emulsion and tocopherol. Another 
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particularly preferred adjuvant formulation employing QS21, 3D-MPL® adjuvant and 
tocopherol in an oil-in-water emulsion is described in WO 95/17210. 

Another enhanced adjuvant system involves the combination of a CpG-containing 
5 oligonucleotide and a saponin derivative particularly the combination of CpG and QS21 
as disclosed in WO 00/09159. Preferably the formulation additionally comprises an oil in 
water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical compositions of the 
10 invention include Montanide ISA 720 (Seppic, France), SAF (Chiron, California, United 
States), ISCOMS (CSL), MF-59 (Chiron), the SB AS series of adjuvants (e.g., SBAS-2 or 
SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox (Enhanzyn®) 
(Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, MT) and other aminoalkyl 
glucosaminide 4-phosphates (AGPs), such as those described in pending U.S. Patent 
15 Application Serial Nos. 08/853,826 and 09/074,720, the disclosures of which are 

incorporated herein by reference in their entireties, and polyoxyethylene ether adjuvants 
such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general formula (I): 
HO(CH 2 CH 2 0) n -A-R 

20 Wherein, n is 1-50, A is a bond or -C(O)-, R is C U5 o alkyl or Phenyl Ci. 50 alkyl. 

One embodiment of the present invention consists of a vaccine formulation comprising a 
polyoxyethylene ether of general formula (I), wherein n is between 1 and 50, preferably 
4-24, most preferably 9; the R component is Ci. 50 , preferably C4-C20 alkyl and most 
preferably C X2 alkyl, and A is a bond. The concentration of the polyoxyethylene ethers 
25 should be in the range 0. 1 -20%, preferably from 0.1-1 0%, and most preferably in the 
range 0.1-1%. Preferred polyoxyethylene ethers are selected from the following group: 
polyoxyethylene-9-lauryl ether, polyoxyethylene-9-steoryl ether, polyoxyethylene-8- 
steoryl ether, polyoxyethylene-4-lauryl ether, poIyoxyethylene-35-lauryl ether, and 
polyoxyethylene-23-lauryl ether. Polyoxyethylene ethers such as polyoxyethylene lauryl 
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ether are described in the Merck index (12 th edition: entry 7717). These adjuvant 
molecules are described in WO 99/52549. 



The polyoxyethylene ether according to the general formula (I) above may, if desired, be 
combined with another adjuvant. For example, a preferred adjuvant combination is 
preferably with CpG as described in the pending UK patent application GB 9820956.2. 



Preferably a carrier is also present in the vaccine composition according to the invention. 
The carrier may be an oil in water emulsion, or an aluminium salt, such as aluminium 
phosphate or aluminium hydroxide. 

A preferred oil-in-water emulsion comprises a metabolisible oil, such as squalene, alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccine 
composition according to the invention are combined with QS21 and 3D-MPL in such an 
emulsion. Additionally the oil in water emulsion may contain span 85 and/or lecithin 
and/or tricaprylin. 

Typically for human administration QS21 and 3D-MPL will be present in a vaccine in the 
range of ljig - 200jxg, such as 10-100|ig, preferably lOfig - 50pig per dose. Typically the 
oil in water will comprise from 2 to 10% squalene, from 2 to 10% alpha tocopherol and 
from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha tocopherol is equal to 
or less than 1 as this provides a more stable emulsion. Span 85 may also be present at a 
level of 1%. In some cases it may be advantageous that the vaccines of the present 
invention will further contain a stabiliser. 

Non-toxic oil in water emulsions preferably contain a non-toxic oil, e.g. squalane or 
squalene, an emulsifier, e.g. Tween 80, in an aqueous carrier. The aqueous carrier may 
be, for example, phosphate buffered saline. 



25 



,i, a o e o 7" s is , o «-i-o .1. o a . 

WO 01/23417 PC^EP00/09S0O 

A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in 
an oil in water emulsion is described in WO 95/17210. 

The present invention also provides a polyvalent vaccine composition comprising a vaccine 
5 formulation of the invention in combination with other antigens, in particular antigens useful 
for treating cancers, autoimmune diseases and related conditions. Such a polyvalent vaccine 
composition may include a TH-1 inducing adjuvant as hereinbefore described. 

According to another embodiment of this invention, an immunogenic composition 
10 described herein is delivered to a host via antigen presenting cells (APCs), such as 

dendritic cells, macrophages, B cells, monocytes and other cells that may be engineered 
to be efficient APCs. Such cells may, but need not, be genetically modified to increase 
the capacity for presenting the antigen, to improve activation and/or maintenance of the T 
cell response, to have anti-tumor effects per se and/or to be immunologically compatible 
15 with the receiver (Le. t matched HLA haplotype). APCs may generally be isolated from 
any of a variety of biological fluids and organs, including tumor and peritumoral tissues, 
and may be autologous, allogeneic, syngeneic or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic cells or progenitors 
20 thereof as antigen-presenting cells. Dendritic cells are highly potent APCs (Banchereau 
and Steinman, Nature 592:245-251, 1998) and have been shown to be effective as a 
physiological adjuvant for eliciting prophylactic or therapeutic antitumor immunity (see 
Timmerman and Levy, Ann. Rev. Med. 50:507-529, 1999). In general, dendritic cells 
may be identified based on their typical shape (stellate in situ, with marked cytoplasmic 
25 processes (dendrites) visible in vitro), their ability to take up, process and present 
antigens with high efficiency and their ability to activate naive T cell responses. 
Dendritic cells may, of course, be engineered to express specific cell-surface receptors or 
ligands that are not commonly found on dendritic cells in vivo or ex vivo 9 and such 
modified dendritic cells are contemplated by the present invention. As an alternative to 
30 dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be 
used within a vaccine (see Zitvogel et ah, Nature Med. 4:594-600, 1998). 
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Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, 
tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, 
umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells 
may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL- 
5 4, LL-13 and/or TNFcc to cultures of monocytes harvested from peripheral blood. 

Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood 
or bone marrow may be differentiated into dendritic cells by adding to the culture 
medium combinations of GM-CSF, EL-3, TNFcc, CD40 ligand, LPS, flt3 ligand and/or 
other compound(s) that induce differentiation, maturation and proliferation of dendritic 
10 cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" cells, which 
allows a simple way to discriminate between two well characterized phenotypes. 
However, this nomenclature should not be construed to exclude all possible intermediate 
stages of differentiation. Immature dendritic cells are characterized as APC with a high 

15 capacity for antigen uptake and processing, which correlates with the high expression of 
Fey receptor and mannose receptor. The mature phenotype is typically characterized by a 
lower expression of these markers, but a high expression of cell surface molecules 
responsible for T cell activation such as class I and class II MHC, adhesion molecules 
(e.g., CD54 and CD1 1) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4- 

20 IBB). 

APCs may generally be transfected with a polynucleotide of the invention (or portion or 
other variant thereof) such that the encoded polypeptide, or an immunogenic portion 
thereof, is expressed on the cell surface. Such transfection may take place ex vivo, and a 

25 pharmaceutical composition comprising such transfected cells may then be used for 
therapeutic purposes, as described herein. Alternatively, a gene delivery vehicle that 
targets a dendritic or other antigen presenting cell may be administered to a patient, 
resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of dendritic 
cells, for example, may generally be performed using any methods known in the art, such 

30 as those described in WO 97/24447, or the gene gun approach described by Mahvi et al., 
Immunology and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may 
be achieved by incubating dendritic cells or progenitor cells with the tumor polypeptide, 
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DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing 
recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or lenti virus 
vectors). Prior to loading, the polypeptide may be covalently conjugated to an 
immunological partner that provides T cell help (e.g., a carrier molecule). Alternatively, 
5 a dendritic cell may be pulsed with a non-conjugated immunological partner, separately 
or in the presence of the polypeptide. 

While any suitable earner known to those of ordinary skill in the art may be employed in 
the pharmaceutical compositions of this invention, the type of carrier will typically vary 
10 depending on the mode of administration. Compositions of the present invention may be 
formulated for any appropriate manner of administration, including for example, topical, 
oral, nasal, mucosal, intravenous, intracranial, intraperitoneal, subcutaneous and 
intramuscular administration. 

Carriers for use within such pharmaceutical compositions are biocompatible, and may 
15 also be biodegradable. In certain embodiments, the formulation preferably provides a 
relatively constant level of active component release. In other embodiments, however, a 
more rapid rate of release immediately upon administration may be desired. The 
formulation of such compositions is well within the level of ordinary skill in the art using 
known techniques. Illustrative carriers useful in this regard include microparticles of 
20 poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, dextran and the like. 
Other illustrative delayed-release carriers include supramolecular biovectors, which 
comprise a non-liquid hydrophilic core (e.g., a cross-linked polysaccharide or 
oligosaccharide) and, optionally, an external layer comprising an amphiphilic compound, 
such as a phospholipid (see e.g, U.S. Patent No. 5,151,254 and PCT applications WO 
25 94/20078, WO/94/23701 and WO 96/06638). The amount of active compound contained 
within a sustained release formulation depends upon the site of implantation, the rate and 
expected duration of release and the nature of the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres (e.g., polylactate 
30 polyglycolate) are employed as carriers for the compositions of this invention. Suitable 
biodegradable microspheres are disclosed, for example, in U.S. Patent Nos. 4,897,268; 
5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 5,814,344, 5,407,609 and 
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5,942,252. Modified hepatitis B core protein carrier systems- such as described in 
WO/99 40934, and references cited therein, will also be useful for many applications. 
Another illustrative carrier/delivery system employs a carrier comprising particulate- 
protein complexes, such as those described in U.S. Patent No. 5,928,647, which are 
5 capable of inducing a class I-restricted cytotoxic T lymphocyte responses in a host. 

The pharmaceutical compositions of the invention will often further comprise one or 
more buffers (e.g., neutral buffered saline or phosphate buffered saline), carbohydrates 
{e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, polypeptides or amino 
acids such as glycine, antioxidants, bacteriostats, chelating agents such as EDTA or 
10 glutathione, adjuvants (e.g., aluminum hydroxide), solutes that render the formulation 

isotonic, hypotonic or weakly hypertonic with the blood of a recipient, suspending agents, 
thickening agents and/or preservatives. Alternatively, compositions of the present 
invention may be formulated as a lyophilizate. 

15 The pharmaceutical compositions described herein may be presented in unit-dose or 
multi-dose containers, such as sealed ampoules or vials. Such containers are typically 
sealed in such a way to preserve the sterility and stability of the formulation until use. In 
general, formulations may be stored as suspensions, solutions or emulsions in oily or 
aqueous vehicles. Alternatively, a pharmaceutical composition may be stored in a freeze- 

20 dried condition requiring only the addition of a sterile liquid carrier immediately prior to 
use. 

The development of suitable dosing and treatment regimens for using the particular 
compositions described herein in a variety of treatment regimens, including e.g., oral, 
25 parenteral, intravenous, intranasal, and intramuscular administration and formulation, is 
well known in the art, some of which are briefly discussed below for general purposes of 
illustration. 

In certain applications, the pharmaceutical compositions disclosed herein may be 
30 delivered via oral administration to an animal. As such, these compositions may be 
formulated with an inert diluent or with an assimilable edible carrier, or they may be 
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enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into tablets, or 
they may be incorporated directly with the food of the diet. 



The active compounds may even be incorporated with excipients and used in the form of 
5 ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, 
and the like (see, for example, Mathiowitz et aL 9 Nature 1997 Mar 27;386(6623):410-4; 
Hwang et aL 7 Crit Rev Ther Drug Carrier Syst 1998;15(3):243-84; U. S. Patent 
5,641,515; U. S. Patent 5,580,579 and IL S. Patent 5,792,451). Tablets, troches, pills, 
capsules and the like may also contain any of a variety of additional components, for 

10 example, a binder, such as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such 
as dicalcium phosphate; a disintegrating agent, such as com starch, potato starch, alginic 
acid and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such 
as sucrose, lactose or saccharin may be added or a flavoring agent, such as peppermint, 
oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may 

15 contain, in addition to materials of the above type, a liquid carrier. Various other 

materials may be present as coatings or to otherwise modify the physical form of the 
dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar, or 
both. Of course, any material used in preparing any dosage unit form should be 
pharmaceutical^ pure and substantially non-toxic in the amounts employed. In addition, 

20 the active compounds may be incorporated into sustained-release preparation and 
formulations. 



Typically, these formulations will contain at least about 0.1% of the active compound or 
more, although the percentage of the active ingredient(s) may, of course, be varied and 

25 may conveniently be between about 1 or 2% and about 60% or 70% or more of the 

weight or volume of the total formulation. Naturally, the amount of active compound(s) 
in each therapeutically useful composition may be prepared is such a way that a suitable 
dosage will be obtained in any given unit dose of the compound. Factors such as 
solubility, bioavailability, biological half-life, route of administration, product shelf life, 

30 as well as other pharmacological considerations will be contemplated by one skilled in 
the art of preparing such pharmaceutical formulations, and as such, a variety of dosages 
and treatment regimens may be desirable. 
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For oral administration the compositions of the present invention may alternatively be 
incorporated with one or more excipients in the form of a mouthwash, dentifrice, buccal 
tablet, oral spray, or sublingual orally-administered formulation. Alternatively, the active 
ingredient may be incorporated into an oral solution such as one containing sodium 
borate, glycerin and potassium bicarbonate, or dispersed in a dentifrice, or added in a 
therapeutically-effective amount to a composition that may include water, binders, 
abrasives, flavoring agents, foaming agents, and humectants. Alternatively the 
compositions may be fashioned into a tablet or solution form that may be placed under 
the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the pharmaceutical compositions 
disclosed herein parenterally, intravenously, intramuscularly, or even intraperitoneal^. 
Such approaches are well known to the skilled artisan, some of which are further 
described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 and U. S. Patent 
5,399,363. In certain embodiments, solutions of the active compounds as free base or 
pharmacologically acceptable salts may be prepared in water suitably mixed with a 
surfactant, such as hydroxypropylcellulose. Dispersions may also be prepared in 
glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary 
conditions of storage and use, these preparations generally will contain a preservative to 
prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersions (for example, see U. S. Patent 5,466,468). In all cases 
the form must be sterile and must be fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and storage and must be preserved 
against the contaminating action of microorganisms, such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), 
suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for 
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example, by the use of a coating, such as lecithin, by the maintenance of the required 
particle size in the case of dispersion and/or by the use of surfactants. The prevention of 
the action of microorganisms can be facilitated by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the 
like. In many cases, it will be preferable to include isotonic agents, for example, sugars 
or sodium chloride. Prolonged absorption of the injectable compositions can be brought 
about by the use in the compositions of agents delaying absorption, for example, 
aluminum monostearate and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, a sterile aqueous medium that can be employed will be known to those of 
skill in the art in light of the present disclosure. For example, one dosage may be 
dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of 
hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). 
Some variation in dosage will necessarily occur depending on the condition of the subject 
being treated. Moreover, for human administration, preparations will of course 
preferably meet sterility, pyrogenicity, and the general safety and purity standards as 
required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed herein may be 
formulated in a neutral or salt form. Illustrative pharmaceutically-acceptable salts include 
the acid addition salts (formed with the free amino groups of the protein) and which are 
formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or 
such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the 
free carboxyl groups can also be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, 
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solutions will be administered in a manner compatible with the dosage formulation and in 
such amount as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, vehicles, 
coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying 
agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such 
media and agents for pharmaceutical active substances is well known in the art. Except 
insofar as any conventional media or agent is incompatible with the active ingredient, its 
use in the therapeutic compositions is contemplated. Supplementary active ingredients 
can also be incorporated into the compositions. The phrase 

"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be delivered by intranasal 
sprays, inhalation, and/or other aerosol delivery vehicles. Methods for delivering genes, 
nucleic acids, and peptide compositions directly to the lungs via nasal aerosol sprays has 
been described, e.g., in U. S. Patent 5,756,353 and U. S. Patent 5,804,212. Likewise, the 
delivery of drugs using intranasal microparticle resins (Takenaga et al. 9 J Controlled 
Release 1998 Mar 2;52(l-2):81-7) and lysophosphatidyl-glycerol compounds (U. S. 
Patent 5,725,871) are also well-known in the pharmaceutical arts. Likewise, illustrative 
transmucosal drug delivery in the form of a polytetrafluoroetheylene support matrix is 
described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid particles, vesicles, 
and the like, are used for the introduction of the compositions of the present invention 
into suitable host cells/organisms. In particular, the compositions of the present invention 
may be formulated for delivery either encapsulated in a lipid particle, a liposome, a 
vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, compositions of the 
present invention can be bound, either covalently or non-covalently, to the surface of 
such carrier vehicles. 
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The formation and use of liposome and liposome-like preparations as potential drug 
carriers is generally known to those of skill in the art (see for example, Lasic, Trends 
Biotechnol 1998 Jul; 16(7): 3 07-21; Takakura, Nippon Rinsho 1998 Mar;56(3):691-5; 
Chandran et aU Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, Crit Rev Ther Drug 
Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. Patent 5,552,157; U.S. 
Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 5,795,587, each specifically 
incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that are normally 
difficult to transfect by other procedures, including T cell suspensions, primaiy 
hepatocyte cultures and PC 12 cells (Renneisen et aL, J Biol Chem. 1990 Sep 
25;265(27):16337-42; Muller et aL, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

In certain embodiments, liposomes are formed from phospholipids that are dispersed in 
an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles 
(also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for pharmaceutically- 
acceptable nanocapsule formulations of the compositions of the present invention. 
Nanocapsules can generally entrap compounds in a stable and reproducible way (see, for 
example, Quintanar-Guerrero et aL, Drug Dev Ind Pharm. 1998 Dec;24(12):l 1 13-28). 
To avoid side effects due to intracellular polymeric overloading, such ultrafine particles 
(sized around 0.1 jim) may be designed using polymers able to be degraded in vivo. Such 
particles can be made as described, for example, by Couvreur et aL, Crit Rev Ther Drug 
Carrier Syst. 1988;5(l):l-20; zur Muhlen et aL, Eur J Pharm Biopharm. 1998 
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Mar;45(2): 149-55; Zambaux et al J Controlled Release. 1998 Jan 2;50(l-3):31-40; and 
U. S. Patent 5,145,684. 

This invention also relates to the use of polynucleotides, in the form of primers derived from 
the polynucleotides of the present invention, and of polypeptides, in the form of antibodies 
or reagents specific for the polypeptide of the present invention, as diagnostic reagents. 

The identification of genetic or biochemical markers in blood or tissues that will enable the 
detection of very early changes along the carcinogenesis pathway will help in determining 
the best treatment for the patient. Surrogate tumour markers, such as polynucleotide 
expression, can be used to diagnose different forms and states of cancer. The identification 
of expression levels of the polynucleotides of the invention will be useful in both the 
staging of the cancerous disorder and grading the nature of the cancerous tissue. The staging 
process monitors the advancement of the cancer and is determined on the presence or 
absence of malignant tissue in the areas biopsied. The polynucleotides of the invention can 
help to perfect the staging process by identifying markers for the aggresivity of a cancer, for 
example the presence in different areas of the body. The grading of the cancer describes 
how closely a tumour resembles normal tissue of its same type and is assessed by its cell 
morphology and other markers of differentiation. The polynucleotides of the invention can 
be useful in determining the tumour grade as they can help in the determination of the 
differentiation status of the cells of a tumour. 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to 
cancers, autoimmune disease and related conditions through diagnosis by methods 
comprising determining from a sample derived from a subject an abnormally decreased or 
increased level of polypeptide or mRNA. This method of diagnosis is known as 
differential expression. The expression of a particular gene is compared between a 
diseased tissue and a normal tissue. A difference between the polynucleotide-related 
gene, mRNA, or protein in the two tissues is compared, for example in molecular weight, 
amino acid or nucleotide sequence, or relative abundance, indicates a change in the gene, 
or a gene which regulates it, in the tissue of the human that was suspected of being 
diseased. 
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Decreased or increased expression can be measured at the RNA level. PolyA RNA is 
first isolated from the two tissues and the detection of mRNA encoded by a gene 
corresponding to a differentially expressed polynucleotide of the invention can be 
detected by, for example, in situ hybridization in tissue sections, reverse trascriptase- 
PCR, using Northern blots containing poly A+ mRNA, or any other direct or inderect 
RNA detection method. An increased or decreased expression of a given RNA in a 
diseased tissue compared to a normal tissue suggests that the transcript and/or the expressed 
protein has a role in the disease. Thus detection of a higher or lower level of mRNA 
corresponding to SEQ ID NO 1 or 3 relative to normal level is indicative of the presence 
of cancer in the patient. 

mRNA expression levels in a sample can be determined by generation of a library of 
expressed sequence tags (ESTs) from the sample. The relative representation of ESTs in 
the library can be used to assess the relative representation of the gene transcript in the 
starting sample. The EST analysis of the test can then be compared to the EST analysis 
of a reference sample to determine the relative expression levels of the polynucleotide of 
interest* 

Other mRNA analyses can be carried out using serial analysis of gene expression (SAGE) 
methodology (Velculescu et. AI. Science (1995) 270:484) , differential display 
methodology (For example, US 5,776,683) or hybridization analysis which relies on the 
specificity of nucleotide interactions. 

Alternatively, the comparison could be made at the protein level. The protein sizes in the 
two tissues may be compared using antibodies to detect polypeptides in Western blots of 
protein extracts from the two tissues. Expression levels and subcellular localization may 
also be detected immunologically using antibodies to the corresponding protein. Further 
assay techniques that can be used to determine levels of a protein, such as a polypeptide of 
the present invention, in a sample derived from a host are well-known to those of skill in the 
art. A raised or decreased level of polypeptide expression in the diseased tissue compared 
with the same protein expression level in the normal tissue indicates that the expressed 
protein may be involved in the disease. 
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In the assays of the present invention, the diagnosis can be determined by detection of gene 
product expression levels encoded by at least one sequence set forth in SEQ ID NOS: 1 or 3. 
A comparison of the mRNA or protein levels in a diseased versus normal tissue may also be 
used to follow the progression or remission of a disease. 

A large number of polynucleotide sequences in a sample can be assayed using 
polynucleotide arrays. These can be used to examine differential expression of genes and to 
determine gene function. For example, arrays of the polynucleotide sequences SEQ ID NO: 
1 or 3 can be used to determine if any of the polynucleotides are differentially expressed 
between a normal and cancer cell. In one embodiment of the invention, an array of 
oligonucleotides probes comprising the SEQ ID NO: 1 or 3 nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic 
mutations. Array technology methods are well known and have general applicability and 
can be used to address a variety of questions in molecular genetics including gene 
expression, genetic linkage, and genetic variability (see for example: M.Chee et al., Science, 
Vol 274, pp 610-613 (1996)). 

'"Diagnosis" as used herein includes determination of a subject's susceptibility to a 
disease, determination as to whether a subject presently has the disease, and also the 
prognosis of a subject affected by the disease. 

The present invention, further relates to a diagnostic kit for performing a diagnostic assay 
which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO: 1 or 3, or a fragment thereof; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO: 2 or 
4, or a fragment thereof; or 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQ ID NO:2 OR 4. 
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The nucleotide sequences of the present invention are also valuable for chromosomal 
localisation. The sequence is specifically targeted to, and can hybridize with, a particular 
location on an individual human chromosome. The mapping of relevant sequences to 
chromosomes according to the present invention is an important first step in correlating 
those sequences with gene associated disease. Once a sequence has been mapped to a 
precise chromosomal location, the physical position of the sequence on the chromosome can 
be correlated with genetic map data. Such data are found in, for example, V. McKusick, 
Mendelian Inheritance in Man (available on-line through Johns Hopkins University Welch 
Medical Library). The relationship between genes and diseases that have been mapped to 
the same chromosomal region are then identified through linkage analysis (coinheritance of 
physically adjacent genes).The differences in the cDNA or genomic sequence between 
affected and unaffected individuals can also be determined. 



The polypeptides of the invention or their fragments or analogs thereof, or cells expressing 
them, can also be used as immunogens to produce antibodies immunospecific for 
polypeptides of the present invention- The term "immunospecific" means that the antibodies 
have substantially greater affinity for the polypeptides of the invention than their affinity for 
other related polypeptides in the prior art. 



In a further aspect the invention provides an antibody immunospecific for a polypeptide 
according to the invention or an immunological fragment thereof as hereinbefore defined. 
Preferably the antibody is a monoclonal antibody 



Antibodies generated against polypeptides of the present invention may be obtained by 
administering the polypeptides or epitope-bearing fragments, analogs or cells to an animal, 
preferably a non-human animal, using routine protocols. For preparation of monoclonal 
antibodies, any technique which provides antibodies produced by continuous cell line 
cultures can be used. Examples include the hybridoma technique (Kohler, G. and Milstein, 
C, Nature (1975) 256:495-497), the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al 9 Immunology Today (1983) 4:72) and the EBV-hybridoma 
technique (Cole et al y Monoclonal Antibodies and Cancer Therapy, 77-96, Alan R. Liss, 
Inc., 1985). 
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Techniques for the production of single chain antibodies, such as those described in U.S. 
Patent No. 4,946,778, can also be adapted to produce single chain antibodies to polypeptides 
of this invention. Also, transgenic mice, or other organisms, including other mammals, may 
be used to express humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones expressing 
the polypeptide or to purify the polypeptides by affinity chromatography. 
The antibody of the invention may also be employed to prevent or treat cancer, particularly 
ovarian and colon cancer, autoimmune disease and related conditions. 

Another aspect of the invention relates to a method for inducing or modulating an 
immunological response in a mammal which comprises inoculating the mammal with a 
polypeptide of the present invention, adequate to produce antibody and/or T cell immune 
response to protect or ameliorate the symptoms or progression of the disease. Yet 
another aspect of the invention relates to a method of inducing or modulating 
immunological response in a mammal which comprises, delivering a polypeptide of the 
present invention via a vector directing expression of the polynucleotide and coding for 
the polypeptide in vivo in order to induce such an immunological response to produce 
antibody to protect said animal from diseases. 

It will be appreciated that the present invention therefore provides a method of treating 
abnormal conditions such as, for instance, cancer and autoimmune diseases, in particular, 
ovarian and colon cancer, related to either a presence of, an excess of, or an under- 
expression of, CASB6411 polypeptide activity. 

The present invention further provides for a method of screening compounds to identify 
those which stimulate or which inhibit the function of the CASB641 1 polypeptide. In 
general, agonists or antagonists may be employed for therapeutic and prophylactic purposes 
for such diseases as hereinbefore mentioned. Compounds may be identified from a variety 
of sources, for example, cells, cell-free preparations, chemical libraries, and natural product 
mixtures. Such agonists, antagonists or inhibitors so-identified may be natural or modified 
substrates, ligands, receptors, enzymes, etc., as the case may be, of the polypeptide; or may 
be structural or functional mimetics thereof (see Coligan et aL> Current Protocols in 
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Immunology l(2):Chapter 5 (1991)). Screening methods will be known to those skilled in 
the art. Further screening methods may be found in for example D. Bennett et aL 9 J Mol 
Recognition, 8:52-58 (1995); and K. Johanson et aL, J Biol Chem, 270(1 6):9459-9471 
(1995) and references therein. 



Thus the invention provides a method for screening to identify compounds which stimulate 
or which inhibit the function of the polypeptide of the invention which comprises a method 
selected from the group consisting of: 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof by means of a label 
directly or indirectly associated with the candidate compound; 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof in the presense of a 
labeled competitior; 

(c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide, using detection systems appropriate to the cells or cell 
membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing a polypeptide of claim 1 , to 
form a mixture, measuring activity of the polypeptide in the mixture, and comparing the 
activity of the mixture to a standard; or 

(e) detecting the effect of a candidate compound on the production of mRNA encoding 
said polypeptide and said polypeptide in cells, using for instance, an ELISA assay. 

The polypeptide of the invention may be used to identify membrane bound or soluble 
receptors, if any, through standard receptor binding techniques known in the art. Well 
known screening methods may also be used to identify agonists and antagonists of the 
polypeptide of the invention which compete with the binding of the polypeptide of the 
invention to its receptors, if any. 

Thus, in another aspect, the present invention relates to a screening kit for identifying 
agonists, antagonists, ligands, receptors, substrates, enzymes, etc. for polypeptides of the 
present invention; or compounds which decrease or enhance the production of such 
polypeptides, which comprises: 
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(a) a polypeptide of the present invention; 

(b) a recombinant cell expressing a polypeptide of the present invention; 

(c) a cell membrane expressing a polypeptide of the present invention; or 

(d) antibody to a polypeptide of the present invention; 

5 which polypeptide is preferably that of SEQ ID NO:2 OR 4. 

It will be readily appreciated by the skilled artisan that a polypeptide of the present 
invention may also be used in a method for the structure-based design of an agonist, 
antagonist or inhibitor of the polypeptide, by: 
10 (a) determining in the first instance the three-dimensional structure of the polypeptide; 

(b) deducing the three-dimensional structure for the likely reactive or binding site(s) of 
an agonist, antagonist or inhibitor; 

(c) synthesing candidate compounds that are predicted to bind to or react with the 
deduced binding or reactive site; and 

15 (d) testing whether the candidate compounds are indeed agonists, antagonists or 
inhibitors. 

Gene therapy may also be employed to effect the endogenous production of CASB641 1 
polypeptide by the relevant cells in the subject. For an overview of gene therapy, see 
20 Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, 
(and references cited therein) in Human Molecular Genetics, T Strachan and A P Read, 
BIOS Scientific Publishers Ltd (1996). 

Vaccine preparation is generally described in Pharmaceutical Biotechnology, Vol.61 
25 Vaccine Design - the subunit and adjuvant approach, edited by Powell and Newman, 

Plenum Press, 1995. New Trends and Developments in Vaccines, edited by Voller et aL, 
University Park Press, Baltimore, Maryland, U.S.A. 1978. Encapsulation within 
liposomes is described, for example, by Fullerton, U.S. Patent 4,235,877. Conjugation of 
proteins to macromolecules is disclosed, for example, by Likhite, U.S. Patent 4,372,945 
30 and by Armor et aL, U.S. Patent 4,474,757. 

The amount of protein in each vaccine dose is selected as an amount which induces an 
immunoprotective response without significant, adverse side effects in typical vaccinees. 
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Such amount will vary depending upon which specific immunogen is employed. 
Generally, it is expected that each dose will comprise 1-lOOOfxg of protein, preferably 
2-lOOp.g, most preferably 4-40jig. An optimal amount for a particular vaccine can be 
ascertained by standard studies involving observation of antibody titres and other 
5 responses in subjects. Following an initial vaccination, subjects may receive a boost in 
about 4 weeks. 

"Isolated" means altered "by the hand of man" from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its 
10 original environment, or both. For example, a polynucleotide or a polypeptide naturally 
present in a living animal is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is "isolated", as the term is 
employed herein. 

15 "Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA including single and 
double stranded regions. 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
20 polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result in amino acid substitutions, additions, deletions, fusions and truncations in the 
25 polypeptide encoded by the reference sequence, as discussed below. A typical variant of 
a polypeptide differs in amino acid sequence from another, reference polypeptide. 
Generally, differences are limited so that the sequences of the reference polypeptide and 
the variant are closely similar overall and, in many regions, identical. A variant and 
reference polypeptide may differ in amino acid sequence by one or more substitutions, 
30 additions, deletions in any combination. A substituted or inserted amino acid residue 
may or may not be one encoded by the genetic code. A variant of a polynucleotide or 
polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant 
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that is not known to occur naturally. Non-naturally occurring variants of polynucleotides 
and polypeptides may be made by mutagenesis techniques or by direct synthesis. 



"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
5 or two or more polynucleotide sequences, as determined by comparing the sequences. In the 
art, "identity" also means the degree of sequence relatedness between polypeptide or 
polynucleotide sequences, as the case may be, as determined by the match between 
strings of such sequences. "Identity" and "similarity" can be readily calculated by known 
methods, including but not limited to those described in (Computational Molecular 

10 Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, 
G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 

15 J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SLAM J, 
Applied Math., 48: 1073 (1988). Preferred methods to determine identity are designed to 
give the largest match between the sequences tested. Methods to determine identity and 
similarity are codified in publicly available computer programs. Preferred computer 
program methods to determine identity and similarity between two sequences include, but 

20 are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids 

Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. 
Molec. BioL 215: 403-410 (1990). The BLAST X program is publicly available from 
NCBI and other sources (BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, 
MD 20894; Altschul, S„ et al, J. Mol. BioL 215: 403-410 (1990). The well known Smith 
25 Waterman algorithm may also be used to determine identity. 

The preferred algorithm used is FASTA. The preferred parameters for polypeptide or 
polynuleotide sequence comparison using this algorithm include the following: 
Gap Penalty: 12 
30 Gap extension penalty: 4 
Word size: 2, max 6 
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Preferred parameters for polypeptide sequence comparison with other methods include 
the following: 

1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: BLOSSUM62 from HentikofF and Hentikoff, Proc. Natl. Acad. Sci. 
5 USA. 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program from 
10 Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for polypeptide comparisons (along with no penalty for end gaps). 

Preferred parameters for polynucleotide comparison include the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
15 Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

A program useful with these parameters is publicly available as the "gap" program from 
20 Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for polynucleotide comparisons. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ ED NO:l, that is be 100% identical, or it may include 

25 up to a certain integer number of nucleotide alterations as compared to the reference 
sequence. Such alterations are selected from the group consisting of at least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 

30 individually among the nucleotides in the reference sequence or in one or more 

contiguous groups within the reference sequence. The number of nucleotide alterations is 
determined by multiplying the total number of nucleotides in SEQ ID NO:l by the 
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numerical percent of the respective percent identity(divided by 100) and subtracting that 
product from said total number of nucleotides in SEQ ID NO:l, or: 

n n ^ * n - (x n • y), 

wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides 
in SEQ ID NO:l, and y is, for instance, 0.70 for 70%, 0.80 for 80%, 0.85 for 85%, 0.90 
for 90%, 0.95 for 95%,etc, and wherein any non-integer product of x n and y is rounded 
down to the nearest integer prior to subtracting it from x n . Alterations of a 
polynucleotide sequence encoding the polypeptide of SEQ ED NO:2 OR 4 may create 
nonsense, missense or frameshift mutations in this coding sequence and thereby alter the 
polypeptide encoded by the polynucleotide following such alterations. 

Similarly, a polypeptide sequence of the present invention may be identical to the 
reference sequence of SEQ ID NO:2 OR 4, that is be 100% identical, or it may include up 
to a certain integer number of amino acid alterations as compared to the reference 
sequence such that the % identity is less than 100%. Such alterations are selected from 
the group consisting of at least one amino acid deletion, substitution, including 
conservative and non-conservative substitution, or insertion, and wherein said alterations 
may occur at the amino- or carboxy-tenninal positions of the reference polypeptide 
sequence or anywhere between those terminal positions, interspersed either individually 
among the amino acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of amino acid alterations for a given % 
identity is determined by multiplying the total number of amino acids in SEQ ID NO:2 
OR 4 by the numerical percent of the respective percent identity(divided by 100) and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2 OR 4, or: 

n a <x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2 OR 4, and y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% 
etc., and wherein any non-integer product of x a and y is rounded down to the nearest 
integer prior to subtracting it from x a . 
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'Homolog" is a generic term used in the art to indicate a polynucleotide or polypeptide 
sequence possessing a high degree of sequence relatedness to a subject sequence. Such 
relatedness may be quntified by determining the degree of identity and/or similarity between 
the sequences being compared as hereinbefore described. Falling within this generic term 
are the terms "ortholog", meaning a polynucleotide or polypeptide that is the functional 
equivalent of a polynucleotide or polypeptide in another species and "paralog" meaning a 
functionally similar sequence when considered within the same species. 



Examples 
Example 1 

Subtractive cDNA cloning of colon tumour-associated antigen (TAA) candidates. 

Subtractive cDNA libraries are produced using standard technologies. Briefly, total 
RNA is extracted from frozen (-70°C) tumour and matched normal colon samples using the 
TriPure reagent and protocol (Boehringer). Target RNA is prepared by pooling total RNA 
from three tumour samples (30 jig each). Driver RNA is prepared by pooling total RNA 
from three matched normal colon samples (10 jig each) and total RNA from seven normal 
tissues other than colon(brain, heart, kidney, liver, bladder, skin, spleen; 1 0 ng each). Total 
RNA from non-colon normal tissues is purchased from InVitrogen. 

Messenger RNA is purified from total RNA using oligo-dT magnetic bead 
technology (Dynal) and quantified by spectrofluorimetry (BioRad). 

Target and driver mRNA are reverse transcribed into cDNA using one of two 
strategies: 1) Target sequences for PGR oligonucleotides are introduced onto the ends of the 
newly synthesised cDNA during reverse transcription using the template switching 
capability of reverse transcriptase (ClonTech SMART PCR cDNA synthesis kit). 2) 
Alternatively, the target and driver mRNA are reverse transcribed into cDNA using an 
oligo-dT primer and converted to double-strand cDNA; the cDNA is cleaved with Rsal and 
linkers for PCR amplification are ligated onto the extremities of the cDNA fragments. 

In both cases, target and driver cDNA are amplified by long range PCR (ClonTech 
SMART PCR Synthesis Kit and Advantage PCR Polymerase Mix) and used as starting 
material for subtractive cloning. For amplification, cycling conditions and optimisation of 
the number of PCR cycles are as described in the Advantage PCR protocol. 
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Two subtractive cloning strategies are used: ClonTech PCR SELECT (see 
ClonTech kit protocol and N. Gurskaya et aL 1996. Analytical Biochemistry: 240, 90) and 
cRDA(M. Hubank and D. Schatz. 1994. Nucleic Acids Research: 22, 5640) . When the PCR 
SELECT protocol is used, the primary PCR SELECT subtraction products are submitted to 
a supplementary round of cRDA subtraction. When the cRDA protocol is used, two 
consecutive cycles of cRDA subtraction are performed. In each case the products of both 
cycles of subtraction are cloned into pCR-TOPO (InVitrogen) and transformed into E. coli 
to produce a subtracted cDNA plasmid library. 

An alternative strategy is also followed: subtraction of nonnal colon sequences and 
sequences from non-colon normal tissues are subtracted in separate hybridizations. In this 
case, target and driver RNA are assembled for the first subtraction as above with the 
exception that non-colon RNA is left out of the driver pool and amounts of normal colon are 
increased to 10 ^ig. Preparation of target and driver cDNA and subtractive hybridization are 
performed as described above. A second subtraction is then performed on the products of the 
first subtraction, but the driver is now composed of a pool of normal colon and normal non- 
colon mRNA from the seven normal tissues. 

Example 2 

Differential Screening of cDNA arrays. 

Identification of tumour-associated genes in the subtracted cDNA library is accomplished by 
differential screening. 

Total bacterial DNA is extracted from 100 fil over-night cultures. Bacteria are lysed with 
guanidium isothiocyantate and the bacterial DNA is affinity purified using magnetic glass 
(Boehringer). Plasmid inserts are recovered from the bacterial DNA by Advantage PCR 
amplification (Clontech). The PCR products are dotted onto two nylon membranes to 
produce high density cDNA arrays using the Biomek 96 HDRT tool (Beekman). The 
spotted cDNA is covalently linked to the membrane by UV irradiation. The first membrane 
is hybridised with a mixed cDNA probe prepared from the tumour of a single patient The 
second membrane is hybridised with an equivalent amount of mixed cDNA probe prepared 
from normal colon of the same patient. The probe cDNA is prepared by PCR amplification 
as described above and is labelled using the AlkPhos Direct System (Amersham). 
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Hybridisation conditions and stringency washes are as described in the AlkPhos Direct kit 
Hybridized probe is detected by chemiluminescence. Hybridisation intensities for each 
cDNA fragment on both blots are measured by film densitometry or direct measurement 
(BioRad Fluor-S Max). The ratio of the tumour to normal hybridisation intensities (T/N) is 
5 calculated for each gene to evaluate the degree of over-expression in the tumour. Genes 
which are significantly over-expressed in colon tumours are followed-up. Significance is 
arbitrarily defined as one standard deviation of the T/N frequency distribution. Differential 
screening experiments are repeated using RNA from multiple patient donors (>18) to 
estimate the frequency of over-expressing tumours in the patient population. 
1 0 In addition, the DNA arrays are hybridised with mixed cDNA probes from normal tissues 
other than colon (see list above) to determine the level of expression of the candidate gene in 
these tissues. 

Example 3 
15 Real-time RT-PCR analysis 

Real-time RT-PCR (U. Gibson. 1996. Genome Research: 6,996) is used to compare 
mRNA transcript abundance of the candidate antigen in matched tumour and normal 
colon tissues from multiple patients. In addition, mRNA levels of the candidate gene in a 
panel of normal tissues are also evaluated by this approach. 

20 

Total RNA from normal and tumour colon is extracted from snap frozen biopsies using 
TriPure reagent (Boehringer). Total RNA from normal tissues is purchased from 
InVitrogen or is extracted from snap frozen biopsies using TriPure reagent. Poly-A+ 
mRNA is purified from total RNA after DNAase treatment using oligo-dT magnetic 
25 beads (Dynal). Quantification of the mRNA is performed by spectrofluorimetry 

(V ersaFluor, BioRad) using Sybrll dye (Molecular Probes). Primers for real-time PCR 
amplification are designed with the Perkin-Elmer Primer Express software using default 
options for TaqMan amplification conditions. 

30 Real-time reactions are assembled according to standard PCR protocols using 2 ng of 
purified mRNA for each reaction. SybrI dye (Molecular Probes) is added at a final 
dilution of 1/75000 for real-time detection. Amplification (40 cycles) and real-time 
detection is performed in a Perkin-Elmer Biosystems PE7700 system using conventional 
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instrument settings. Ct values are calculated using the PE7700 Sequence Detector 
software. Several Ct values are obtained for each samples : for the patient samples, the 
tumour Ct (CtT) and the matched normal colon Ct (CtN) values on the candidate TAA, 
and for the panel of normal tissue samples, a CtXY for each normal tissue XY. An 
another Ct (CtA) is also calculated on Actin gene, as an internal reference, for all of the 
samples. 

As the efficiency of PCR amplification under the prevailing experimental conditions is 
close to the theoretical amplification efficiency, 2 (CtNn7XY ~ CtA) value is an estimate of the 
relative TAA transcript level of the sample, standardised with respect to Actin transcript 
level. A value of 1 thus suggests the candidate antigen and Actin have the same 
expression level. 

Real-time PCR reactions were performed on tumour colon and adjacent normal colon 
from biopsies of 18 patients. Results are shown in figure 1. 36 normal tissue samples, 
representing 28 different tissues (see table 2), were also tested by the same procedure. 
Results are shown in figure 2. 

TAA transcript levels are calculated as described above. The proportion of patients over- 
expressing the candidate antigen, as well as the average transcript over-expression versus 
normal tissues is also calculated from this data set. 
Overall results are shown in Table 1 : 



Table 1 : CASB6411 Real-time PCR expression results 



% of patients with a CASB6411 transcript level higher in tumour colon than 
adjacent normal colon (positive patients) 


83% 


% of positive patients with a CASB6411 transcript level at least 3 fold higher 
CASB641 1 transcript than adjacent normal colon 


60% 


% of positive patients with a CASB641 1 transcript level at least 10 fold higher in 
tumour colon than adjacent normal colon 


40% 


Average transcript over-expression fold in positive patients 


40 


% of patients with a CASB6411 transcript level higher in tumour colon than 
normal tissue median 


100% 


% of patients with a mRNA level at least 3 fold higher in tumour colon than 
normal tissue median 


100% 
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% of patients with a mRNA level at least 10 fold higher in tumour colorTthan 
normal tissue median 

Normal tissues where CASB64 1 1 transcript expression is equivalent than tumouF 
transcript level in tumours 



Lung, 
prostate 



Table 1 clearly suggest CASB641 1 transcript is over-expressed in colorectal tumours 
compared to adjacent normal colon and to most of the above mentioned normal tissues. 
More than 80% of the patients strongly over-express CASB641 1 transcript in tumour, as 
compared to adjacent normal colon. Average over-expression fold is at least of 40, with 
40 % of patients having overexpressing at least 10 fold. More over, all of the patients 
over-express CASB641 1 transcript in colorectal tumors, as compared to other normal 
tissues, 40 % of them overexpressing it at least 10 fold. 

Table 2 : listing of normal tissues used for CASB7439 transcript expression analysis. 



Tissue 
lAdrenal gland 



Abbreviation 



Categorie 



Ad Gl 



non dispensable 



Aorta 
Bladder 



Ao 

IT 



I non dispensable 
non dispensable 



[Bone m arrow 
! Brain — 



Bo Ma 



non dispensable 



Cervix 



Bra 



Ce 



non dispensable 



non dispensable 



[Colon 
| Fallopian tube 



Lung 
Lymph node 



Oesophagus 
Parathyr oid gland 
Rectum 



Skin 
Skeletal muscle 
Small intestine 



[Spleen 
Stomach" 
Thyroid g land 
Trachea 



Co 
Fa Tu 



He 
II 



Ki 



LU 

_y__No 



Qe 

p a_Thy 
Re 



[Ovary 
Placenta 
[ Prostat e 
Testis 



Sk 
Sk_Mu 
Sm In 



Sp 

Thy 
Tra 



Ov 



Pl_ 
Pr 



I non dispensable 
non dispensable 



|non dispensable 
non dispensable 



|non dispensable 
non dispensable 



|non dispensable 
non dispensable 



I non dispensable 
[non dispensable 
non dispensable 



non dispensable 
[non dispensable 
\non dispensable 



non dispensable 
non dispensable 
non dispensable 
non dispensable 



dispensable 



dispensable 
dispensable 



re 



dispensable 
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Example 4 

DNA microarrays 

DNA micro-arrays are used to examine mRNA expression profiles of large collections of 
genes in multiple samples. This information is used to complement the data obtained by 
real-time PCR and provides an independent measure of gene expression levels in tumors 
and normal tissues. 

Examples of current technologies for production of DNA micro-arrays include 1) The 
Affymetrix "GeneChip" arrays in which oligonucleotides are synthetized on the surface 
of the chip by solid phase chemical synthesis using a photolithographic process 2) DNA 
spotting technology in which small volumes of a DNA solution are robotically deposited 
and then immobilized onto the surface of a solid phase (e.g. glass). In both instances, the 
chips are hybridized with cDNA or cRNA which has been extracted from the tissue of 
interest (e.g. normal tissue, tumour etc.) and labeled with radioactivity or with a 
fluorescent reporter molecule. The labeled material is hybridized to the chip and the 
amount of probe bound to each sequence on the chip is determined using a specialized 
scanner. The experiment can be set-up with a single fluorescent reporter (or radioactivity) 
or, alternatively, can be performed using two fluorescent reporters. In this latter case, 
each of the two samples is labeled with one of the reporter molecules. The two labeled 
samples are then hybridized competitively to the sequences on the DNA chip. The ratio 
of the two fluorescent signals is determined for each sequence on the chip. This ratio is 
used to calculate the relative abundance of the transcript in the two samples. Detailed 
protocols are available from a number of sources including "DNA Microarrays: A 
practical approach. Schena M. Oxford University Press 1999" and the World Wide Web 
(http://cmgm.stanford.edu/pbrown/protocols/index.htail), http://arrayit.com/DNA- 
Microarray-Protocols/) and specialized distributors (e.g. Affymetrix). 

Example 5 

Northern-Southern blot analysis 

Limited amounts of mixed tumour and matched normal colon cDNA are amplified by 
Advantage PCR (see above). Messenger RNA from multiple normal tissues is also 
amplified using the same procedure. The amplified cDNA (1 fig) is electrophoresed on a 
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1.2% agarose gel and transferred onto a nylon membrane. The membrane is hybridised 
(AlkPhos Direct System) with a probe prepared using a fragment of the candidate TAA 
cDNA. Northern-Southern analysis provides information on transcript size, presence of 
splice variants and transcript abundance in tumour and normal tissues. 



Example 6 

Northern Blot Analysis 

Northern blots are produced according to standard protocols using 1 fig of poly A+ 
mRNA. Radioactive probes are prepared using the Ready-to-Go system (Pharmacia). 

Example 7 

In silico detection of the full length cDNA sequence 

EST sequence databases are screened with experimentally obtained cDNA sequence 
fragments, using the Blast algorithm (Altschul, S.F., Gish, W., Miller, W., Myers, E.W. 
& Lipman, DJ. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-41). 
The aim is to search for overlapping or longer identical EST sequences. Matched EST 
sequences are then assembled together, using the SeqMan software of the Lasergene 
package (DNASTAR). The consensus sequence of the resulting assembly is an EST- 
derived longer cDNA. This EST-derived cDNA is further analysed using the GeneMark 
software to find a potential open reading frame (ORF). The translated sequence of the 
ORF is compared with protein databases, using the Blast algorithm, to find homologues. 
If any, the homologous protein sequences are further used to complete the cDNA 
prediction by searching for genomic contig homologies using the Wise2 algorithm, 
leading to a genome-derived, virtual cDNA sequence. This virtual cDNA in finally 
assembled with EST-derived cDNA, and the new consensus cDNA undergoes a final 
check against ESTs to confirm the Wise2 prediction, and correct potential sequencing 
errors and frameshifts. The virtual cDNA is considered as a virtual full length cDNA 
once a full ORF (from start to stop codons), with clear protein homologies and coding 
potential. 

The SEQ ID NO:l has been obtained using this in silico cloning method of the fiill length 
cDNA sequence of CASB641 1, which has a putative open reading frame of 460 amino 
acids (SEQ ED NO:2), and a potential isoform generated by alternative splicing (SEQ ID 
NO:3), encoding a truncated protein (SEQ ID NO:4). 
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Virtual full-length cDNA sequence is experimentally checked as described below. 
Example 8 

Experimental Identification of the full length cDNA sequence 

Colon tumour cDNA libraries are constructed using the Lambda Zap II system 
(Stratagene) from 5 \ig of polyA+ mRNA. The supplied protocol is followed except that 
Superscript]! (Life Technologies) is used for the reverse transcription step. Oligo dT- 
primed and random-primed libraries are constructed. About 1.5 x 106 independent phages 
are plated for each screening of the library. Phage plaques are transferred onto nylon 
filters and hybridised using a cDNA probe labelled with AlkPhos Direct. Positive phages 
are detected by chemiluminescence. Positive phage are excised from the agar plat, eluted 
in 500\i\ SM buffer and confirmed by gene-specific PCR. Eluted phages are converted to 
single strand Ml 3 bacteriophage by in vivo excision. The bacteriophage is then converted 
to double strand plasmid DNA by infection of E. coli. Infected bacteria are plated and 
submitted to a second round of screening with the cDNA probe. Plasmid DNA is purified 
from positive bacterial clones and sequenced on both strands. 



When the full length gene cannot be obtained directly from the cDNA library, missing 
sequence is isolated using RACE technology (Marathon Kit, ClonTech.). This approach 
relies on reverse transcribing mRNA into double strand cDNA, ligating linkers onto the 
ends of the cDNA and amplifying the desired extremity of the cDNA using a gene- 
specific primer and one of the linker oligonucleo tides. Marathon PCR products are cloned 
into a plasmid (pCRH-TOPO, InVitrogen) and sequenced. 



Example 9. 
EST profiles 

A complementary approach to experimental antigen tissue expression characterization is 
to explore the human EST database. ESTs ('Expressed Sequence Tags ) are small 
fragments of cDNA made from a collection of mRNA extracted from a particular tissue 
or cell line. Such database currently provides a massive amount of human ESTs (10 6 ) 
from several hundreds of cDNA tissue libraries, including tumoral tissues from various 
types and states of disease. By means of informatics tools (Blast), a comparison search of 
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the CASB641 1 sequence is performed in order to have further insight into tissue 
expression. 



EST distribution of CASB6411 : 



EST GenBank Accession number 


EST cDNA tissue library 


C00562 


Human adult (K.Okubo) 


C05837 


Human pancreatic islet 


AA1 72076 


Stratagene ovarian cancer (#937219) 


AA1 72244 


Stratagene ovarian cancer (#937219) 


AA612697 


NCLCGAP_Co10 


AA371314 


Prostate gland I 


AI264367 


NCI CGAP Co8 


AI270207 


NCLCGAP_Co14 


A1278830 


NCLCGAP_Co8 


AJ281230 


NC l_CGAP_Co8 


AI283827 


NCI_CGAP_Co8 


AI285194 


NCI CGAP Co8 


AI285227 


NCLCGAP_Co8 


AI346622 


NCLCGAP_Co8 


A1362363 


NCI CGAP Gas4 


AI473464 


NCI CGAP Gas4 


AI697014 


NCI_CGAP_Pan1 


AI799626 


NCLCGAP_Gas4 


AI830044 


NCI CGAP Lu19 


A1921465 


NCLCGAP_Gas4 


AI925050 


NCI_CGAP_Gas4 


AW029127 


NCI_CGAPJ3as4 


AW365013 


DT0057 


AW452356 


NCI_CGAP_Sub5 


AW469177 


NCI CGAP Gas4 


AW469181 


NCLCGAP_Gas4 


AW582253 


ST0212 


AW8 10203 


ST0125 


AW8 10268 


ST0125 


AW810418 


ST0125 


AW8 14058 


ST0198 


AW869793 


SN0075 



SEQ ID NO:l perfectly aligns with 32 ESTs : 13 are from 2 stomach cDNA libraries, 9 
are from 3 tumor colon libraries, 2 are from one tumor ovary library, one is from one 
tumor pancreas library, one is from one lung tumor library, one is from one mixed tumor 
10 library, one is from one normal prostate library, one is from one normal stomach library, 
one is from normal pancreas library, 2 are from libraries of unknown type. 
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This clearly suggests, as expected, that CASB641 1 is over-expressed in tumor tissues, 
with an emphasis in colorectal and stomach tumor tissues, as compared to normal 
tissues. 

Example 10 : 

10.1 Expression and purification of tumour-specific antigens 

Expression in microbial hosts, or alternatively in vitro transcription/translation, is used to 
produce the antigen of the invention for vaccine purposes and to produce protein 
fragments or whole protein for rapid purification and generation of antibodies needed for 
characterization of the naturally expressed protein by immunohistochemistry or for 
follow-up of purification. 

Recombinant proteins may be expressed in two microbial hosts, E. coli and in yeast 
(such as Saccharomyces cerevisiae or Pichia pastoris). This allows the selection of the 
expression system with the best features for this particular antigen production. In 
general, the recombinant antigen will be expressed in E. coli and the reagent protein 
expressed in yeast. 

The expression strategy first involves the design of the primary structure of the 
recombinant antigen. In general an expression fusion partner (EFP) is placed at the N 
terminal extremity to improve levels of expression that could also include a region useful 
for modulating the immunogenic properties of the antigen, an immune fusion partner 
(IFP). In addition, an affinity fusion partner (AFP) useful for facilitating further 
purification is included at the C-terminal end. 

When the recombinant strains are available, the recombinant product is characterized by 
the evaluation of the level of expression and the prediction of further solubility of the 
protein by analysis of the behavior in the crude extract. 

After growth on appropriate culture medium and induction of the recombinant protein 
expression, total extracts are analyzed by SDS-PAGE. The recombinant proteins are 
visualized in stained gels and identified by Western blot analysis using specific 
antibodies. 

A comparative evaluation of the different versions of the expressed antigen will allow the 
selection of the most promising candidate that is to be used for further purification and 
immunological evaluation. 
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The purification scheme follows a classical approach based on the presence of an His 
affinity tail in the recombinant protein. In a typical experiment the disrupted ceils are 
filtered and the acellular extracts loaded onto an Ion Metal Affinity Chromatography 
5 (IMAC; Ni+-*-NTA from Qiagen) that will specifically retain the recombinant protein. 

The retained proteins are eluted by 0-500 mM Imidazole gradient (possibly in presence of 
a detergent) in a phosphate buffer. 

10.2 Antibody production and immun ©histochemistry 

10 Small amounts of relatively purified protein can be used to generate immunological 
tools in order to 

a) detect the expression by immunohistochemistry in normal or cancer tissue sections; 

b) detect the expression, and to follow the protein dining the purification process 
(ELISA/ Western Blot); or 

15 c) characterise/ quantify the purified protein (ELISA). 

1 0.2. 1 Polyclonal antibodies: 
Immunization 

Rabbits are immunised , intramuscularly (I.M.) , 3 times at 3 weeks intervals with 
20 100p.g of protein, formulated in the adjuvant 3D-MPL/QS2L Three weeks after each 
immunisation a blood sample is taken and the antibody titer estimated in the serum by 
ELISA using the protein as coating antigen following a standard protocol. 

ELISA 

25 96 well microplates (maxisorb Nunc) are coated with 5p,g of protein overnight at 4°C. 
After lhour saturation at 37°C with PBS NCS 1%, serial dilution of the rabbit sera is 
added for 1H 30 at 37°C (starting at 1/10). After 3 washings in PBS Tween, anti rabbit 
biotinylated anti serum (Amersham ) is added (1/5000). Plates are washed and 
peroxydase coupled streptavidin (1/5000) is added for 30 min at 37°C. After washing, 

30 50^1 TMB (BioRad) is added for 7 min and the reaction then stopped with H2S04 0.2M. 
The OD can be measured at 450 nm and midpoint dilutions calculated by SoftmaxPro. 

10.2.2 Monoclonal antibodies : 
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Immunization 

5 BALB/c mice are immunized 3 times at 3 week intervals with 5 fig of purified protein. 
Bleedings are performed 14 days post II and 1 week post 3. The sera are tested by Elisa 
on purified protein used as coated antigen. Based on these results (midpoint dilution > 

< 

1 0000 ) one mouse is selected for fusion 
Fusion/ HATselection 

Spleen cells are fused with the SP2/0 myeloma according to a standard protocol using 
PEG 40% and DMSO 5%. Cells are then seeded in 96 well plates 2.5 xl04 - 105 
cells/well and resistant clones will be selected in HAT medium. The supernatant of these 
hybridomas will be tested for their content in specific antibodies and when positive, 
will be submitted to 2 cycles of limited dilution . After 2 rounds of screening, 3 
hybridomas will be chosen for ascitis production. 

10.2.3 Immunohistochemistry 

When antibodies are available, immuno staining is performed on normal or cancer tissue 
sections, in order to determine : 

0 the level of expression of the antigen of the invention in cancer relative to normal 
tissue or 

0 the proportion of cancer of a certain type expressing the antigen 

0 if other cancer types also express the antigen 

0 the proportion of cells expressing the antigen in a cancer tissue 



Tissue sample preparation 

After dissection, the tissue sample is mounted on a cork disk in OCT compound and 
rapidly frozen in isopentane previously super cooled in liquid nitrogen (-160°C). The 
block will then be conserved at -70°C until use. 7-10jim sections will be realised in a 
cryostat chamber (-20, -30°C). 

Staining 

Tissue sections are dried for 5 min at room Temperature (RT), fixed in acetone for 

lOmin at RT, dried again, and saturated with PBS 0.5% BSA 5% serum. After 30 min at 
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RT either a direct or indirect staining is performed using antigen specific antibodies. A 
direct staining leads to a better specificity but a less intense staining whilst an indirect 
staining leads to a more intense but less specific staining. 

5 10.3 Analysis of human cellular immune responses to the antigen of the invention 

The immunological relevance of the antigen of the invention can be assessed by in vitro 
priming of human T cells. All T cell lymphocyte lines and dendritic cells are derived 
from PBMCs (peripheral blood mononuclear cells) of healthy donors (preferred HLA-A2 
10 subtype). An HLA-A2.1/Kb transgenic mouse model is also used for screening of HLA- 
A2.1 peptides. 

Newly discovered antigen-specific CD 8+ T cell lines are raised and maintained by 
weekly in vitro stimulation. The lytic activity and the d-IFN production of the CD8 lines 
15 in response to the antigen or antigen derived-pep tides is tested using standard procedures. 

Two strategies to raise the CD8+ T cell lines are used: a peptide-based approach and a 
whole gene-based approach. Both approaches require the full-length cDNA of the newly 
discovered antigen in the correct reading frame to be either cloned in an appropriate 
20 delivery system or to be used to predict the sequence of HLA binding peptides. 

Peptide-based approach 

Briefly, transgenic mice are immunized with adjuvanted HLA-A2 peptides, those unable 
to induce a CD8 response (as defined by an efficient lysis of peptide-pulsed autologous 

25 spleen cells) will be further analyzed in the human system. 

Human dendritic cells (cultured according to Romani et al.) will be pulsed with peptides 
and used to stimulate CD8-sorted T cells (by Facs). After several weekly stimulations, the 
CD8 lines will be first tested on peptide-pulsed autologous BLCL (EBV-B transformed 
cell lines). To verify the proper in vivo processing of the peptide, the CD8 lines will be 

30 tested on cDNA-transfected tumour cells (HLA-A2 transfected LnCaP, Skov3 or CAMA 
tumour cells). 

Whole gene-based approach 
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CD8+ T cell lines will be primed and stimulated with either gene-gun transfected 
dendritic cells, retrovirally transduced B7.1 -transfected fibroblasts, recombinant pox 
virus (Kim et al.) or adenovirus (Butterfield et al.) infected dendritic cells. Virus infected 
cells are very efficient to present antigenic peptides since the antigen is expressed at high 
5 level but can only be used once to avoid the over-growth of viral T cells lines. 

After alternated stimulations, the CD8+ lines are tested on cDNA-transfected tumour 
cells as indicated above. Peptide specificity and identity is determined to confirm the 
immunological validation. 

10 

CD4+ T-cell response 

Similarly, the CD4+ T-cell immune response can also be assessed. Generation of specific 
CD4+ T-cells is made using dendritic cells loaded with recombinant purified protein or 
15 peptides to stimulate the T-cells. 

Predicted epitopes (nonamers and decamers) binding HLA alleles : 

The HLA Class I binding peptide sequences are predicted either by the Parker's 

algorithm (Parker, K. C, M. A. Bednarek, and J. E. Coligan. 1994. Scheme for ranking 

20 potential HLA-A2 binding peptides based on independent binding of individual peptide 
side-chains. J. Immunol. 152:163 and http://bimas.dcrtnih.gov/molbio/hla_bind/) or the 
Rammensee method (Rammensee, Friede, Stevanovic, MHC ligands and peptide motifs: 
1st listing, Immunogenetics 41, 178-228, 1995 ; Rammensee, Bachmann, Stevanovic: 
MHC ligands and peptide motifs. Landes Bioscience 1997, and 

25 http://134.2.96.221/scripts/hlaserver.dll/home.htm). Peptides are then screened in the 
HLA-A2.1/Kb transgenic mice model (Vitiello et al.). 

The HLA Class II binding peptide sequences are predicted using the Tepitope algorithm, 
with a score cut-off set to 6 (Sturniolo, Hammer at al., Nature Biotechnology. 1999. 
17;555-561). 

30 The following tables gather the Class I and II predicted epitope sequences : 



HLA-Al ; decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQED: 


1 


379 


ITEGRKIMIR 


1 12.500 


SEQ ID NO:9 
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HLA-A0201 



[Rank 


Start Position 


Subsequence Residue Listing 


1 1 


191 


LLMDFVFSL 


1 2 


292 


FLLFFPSFT 


I 


285 


QMMTFFIFL 


4 


286 


MMTFFIFLL 




403 
147 


FLIEKLIKL 1 


7 


240 


VLLIRNIFL 
LVWIGIFFC 


8 

9 j 


305 
22 


TLAITIWRL 


10 


203 j 


LIFCWDFTV j 
FLGEFLRRI j 


1 1 


112 


LLLPFWSC J 


12 


66 


LLTRFSAYM | 


13 


21 


_ KLIFCWDFT " | 



nonamers 

Pa rker's Scor e 

27926.980 
2650.811 
2266.849 



1329.564 
926.658 
739.032 
450.023 
368.501 
348.892 
343.941 
273.114 
176.513 
119.495 



SEQ ED : 
SEQ ID NO: 1 Q 
SEQ ID NO: 1 1 



SEQ ID NO: 12 



SEQ ID NO: 13 
SEQ ID NO: 14 
SEQ ID NO: 1 5 
SEQ ID NO: 1 6 
SEQ ID NO: 1 7 
SEQ ID NO: 18 



SEQ ID NO: 19 
SEQ ID NO:20 
SEQ ID NO:2 1 



SEQ ID NO:22 



HLA-A0201 



| Rank 


Start Position 


_ Subsequence Residue Listing 


} 1 


" 190 


LLLMDFVFSL 


j 2 


191 


LLMDFVFSLV 


j 3 


285 


1 QMMTFFIFLL 


j 4 


93 


j YLAEYNLEFL 


j 5 


[ 21 


1 KLIFCWDFTV 


j 6 


[ 293 


1 LLFFPSFTGV 


} 7 


[ 65 


1 QLLTRFSAYM 


1 8 


1 239 


1 TLVWIGIFFC 


| 9 | 


284 


1 SQMMTFFIFL 


1 10 j 


111 | 


[ VLLLPFWSC 


j 11 j 


162 | 


ILCYYWLNTV 


[ 12 | 


366 ~j 


TLIVLIITYL 


[ 13 1 


146 j 


YVLLIRNIFL 


14 j 


195 | 


FVFSLVNSFL 


15 j 


112 j 


LLLPFWSCI 


16 j 

17 } 

18 [ 


355 { 


LIGSVHFFFI 


138 | 
66 1 


YEMPRHEVYV 
LLTRFSAYMV 



: decamers 

Parker's Score * 
6811.458 
3407.985 
2893.757 
2497344 
1411.906 
831.216 
384.175 
364.502 
318.231 
273.114 
271.948 
270.234 
174.977 
174.977 
150.931 
147.808 
126.598 
118.238 



SEQ ID : 



SEQ ID NO:23 
SEQ ID NO:24 
SEQ ID NO:25 
SEQ ID NO:26 
SEQ ID NO:27 
SEQ ID NOl2S 
SEQ ID NO;29 
SEQ ID NO:30 
SEQ ID NQ:3 1 
SEQ ID NO:32 
SEQ ID NO:33 
SEQ ID NO:34 
SEQ ID NO:35 
SEQ ID NO:36 
SEQIDNO:37 
SEQ ID NO:38 
SEQ ID NO:39 
SEQ ID NO:40 




Rank 

T 

2 



Start Position 
191 

403 



Subsequence Residue Listing 
LLMDFVFSL 
FLIEKLIKL 



Parker's Score * 
514.080" 
126.000 



SEQ ID : 

SEQ ID NO: 10 
SEQ ID NO: 14 
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HLA A205 : decamers 



Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID 


1 


190 


LLLMDFVFSL 


171.360 


SEQ ID NO:23 


2 


146 


YVLLIRNIFL 


126.000 


SEQ ID NO:35 


3 


195 


FVFSLVNSFL 


126.000 


SEQ ID NO:36 


4 


285 


QMMTFFIFLL 


100.800 


SEQ ED NO:25 




HLA A24 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ED : 


1 


14 


I YSGGITKL 


220.000 


SEQ ID NO:41 


2 


91 


VYYLAEYNL 


200.000 


SEQ ID NO:42 


3 


165 


YYWLNTVAL 


200.000 


SEQ ID NO:43 


4 


187 


IYRLLLMDF 


120.000 


SEQ ID NO:44 




HLA A24 : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score 


SEQ ID : 


1 


145 


VYVLLIRNIF 


252.000 


SEQ ED NO:45 


2 


164 


CYYWLNTVAL 


200.000 


SEQ ID NO:46 


3 


92 


YYLAEYNLEF 


165.000 


SEQ ED NO:47 




HLA A3 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parler's Score * 


SEQ ID : 


1 


258 


MLFIMFYSK 


900.000 


SEQ ID NO:48 


2 


148 


LLIRNIFLK 


135.000 


SEQ ID NO:49 


3 


221 


GLQEFDIAR 


108.000 


SEQ ID NO:50 


• : Estimate of Half Time of Disassociation of a Molecule Containing This Subsequence. 


HLA A3 : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


401 


KMFLIEKLIK 


600.000 


SEQIDNO:51 


2 


257 


IMLFIMFYSK 


270.000 


SEQ ID NO:52 


3 


407 


KLIKLQDMEK 


180.000 


SEQ ID NO:53 


4 


147 


VLLIRNIFLK 


135.000 


SEQ ID NO:54 








HLA B7 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


140 


MPRHEVYVL 


800.000 


SEQ ID NO:55 








HLA B7 : decamers 


Rank 


jStart Position | Subsequence Residue Listing | Parker's Score * | SEQ ID : 
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Rank 


Start Position 


1 


138 ~ 



HLA B4403 nonamcrs 



Subsequence Residue Listing 

YEMPRHEVY 



Parker's Score * 

480.000 



SEQD>: 

SEQ ID NO:57 




Estimate of half time of disassociation of a molecule containing this subsequence. 



Rank 
I 

2 



Start Position 

363 
210 



HLA-DRB1*1501 
Subsequence Residu e Listing 

VHFFFILTL 
LRRIIGMQL 



nonamers 

Tepitope ScorT 

6.2 



SEQ ID : 

SEQ ID NQ:59 
SEQ ID NO:60 



R ank 1 Start Pos ition 
1 1 193 



HLA-DRB1*Q301 : nonamers 



Subsequence Resi due Listing 

LLMDFVFSL 



Tep itope Score 



SEQ ID 

SEQIDNO:61 
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SEQUENCE INFORMATION 
SEQ ID NO:l 

TGGGGAGGCAGAAGGCAGACTGATCACTTGAGGCCAGGAGTTTGAGACCTCAT 
5 GTCTAAAAAAAAAAAATTCTGTGAGGTGAGTTTTAT^ 

ATATGGAAACTGAGGCTGAGAATCAGAACCATTCACAAGACAAAAATCCCCCAG 
TTGGCAGATCCAGGGTTGCAAGCCAGGCCTGTGCAGCCCCAAAACCAGTGCTTG 
TTTAACCACTGTGTGGTGACCACACCGCTCCAGGCCAACAGCTTGGGGCTAAGT 
CTTCACGTTGCCTTTCACCATTAAATAATAGGGCTGCCCTTTGTTGAAGCCC 
1 0 ACTCCC AGTG ACGGCCAT AAT AACCTTCAGGTGTTCTGCTTTCTGCCTTCTCT AG 
Catggccaagtatttccggaacaacttcattaatccccacatttactccgg 

tcactcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgagataagggagaacctgtcagagctccgtcaggagaa 
ttccaagttgacgttcaatcagctgctgacccgcttctctgcc^ 

tgcagccgtttattacctggctgagtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgt 
15 gtcctgcattaatctggccgtgccatgcatctactccatgttcaggcttgtggagaggtacgagatgccacggcacgaagtctacgttc 
tccrtgatccgaaacatcttmgaaaatatcaatcattggcattcmgttactatt^ 

ttgggaaaccctcattggccaggacatctaccggctccttctgatgga 

ggagaatc^ttgggatgcaartgatcacaagtcttggccttcaggagtttgacattgccaggaacgttctagaactgatctatgcaca 
aactctggtgtggattggcatcttcttctgccccctgctgccc 
20 gcctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatgatgac^ 

caccggggtcttgtgcaccctggccatcaccatctggagattgaagccttcagctgactgtggcccttttcgaggtctgcctctcttcat 

tcactccatctacagctggatcgacaccctaagtacacggcctggc^ 

tgcacttctttttcatcctcaccctcattgtgctaatcatcacc^ 

tgctccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaaaaattgatcaagctgcaggatatggagaagaaa 

25 gcaaaccccagctcacttgttrtggaaaggagagaggtggagcaacaaggm 

ttgcgatctagaagatcagttcaagaaggtaatccaagggcctgaTGACTCrriTGGTAACCAGACACCAA 
TCAAATAAGGGGAGGAGACGAAAATGGAATGATTTCITCCATGCCACCTGTGCC 
TTTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACT 
TACCATGTAATTATCAAAGTAAAATTC 

30 ATTGCTGATTTTTCAAGACAAAATACTTG 

TATTGAAATGAGCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATA 
TTATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCT 
GACITCCCAGACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAA^ 
CCTGAATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAA 

35 CCTGCATTTATTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGT 

AATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCT 
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TTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCAT 
CAAAGTCCTACGTCAAGCT 



5 SEQ ID NO:2 

MAKYFRNNFINPHIYSGGITKLIFCWDFTVTHEKAVKLKQKNLSTEIRENLSELRQENSKLTFNQLLTRFSA 
YMVAWVVSTGVAIACCAAVYYLAEYNLEFLKTHSNPGAVLLLPFVVSCINLAVPCIYSMFRLVERYEMPRHE 
VYVLLI RN I FLKI S 1 1 G I LC YYWLNT VALSGEECWETL I GQDI YRLLLMDFVFSLVNS FLGE FLRRI IGMQL 
10 ITSLGLQEFDIARWLELIYAQTLVWIGIFFCPLLPFIQMIMLFIMFYSKNISLMMNFQPPSKAWRASQMMT 
FFIFLLFFPSFTGVLCTLAITIWRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRPGYLWWWIYRNLIGSVH 
FFFILTLIVLIITYLYWQITEGRKIMIRLLHEQIINEGKDKMFLIEKLIKLQDMEKKANPSSLVLERREVEQ 
QG FLHLGE H DG S LDLRS RRS VQEGN PRA 

15 SEQ ID NO:3 

TGGGGAGGCAGAAGGCAGACTGATCACTTGAGGCCAGGAGTTTGAGACCTCAT 
GTCTAAAAAAAAAAAATTCTGTGAGGTGAGTTTTATTGTTAT^ 

ATATGGAAACTGAGGCTGAGAATCAGAACCATTCACAAGACAAAAATCCCCCAG 
20 TTGGCAGATCCAGGGTTGCAAGCCAGGCCTGTGCAGCCCCAAAACCAGTGCTTG 
TTTAACCACTGTGTGGTGACCACACCGCTCCAGGCCAACAGCTTGGGGCT 
CTTCACGTTGCCITTCACCATTAAATAATAGGGCTGCCCT^ 
ACTCCCAGTGACGGCCATAATAACCTTCAGGTGT 

Catggccaagtatttccggaacaacttcattaatccccacatttactccggagggatcaccaagctgatcttttgctgggacttcactg 
25 tcactcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgagataagggagaacctgtcagagctccgtcaggagaa 
ttccaagttgacgttcaatcagctgctgacccgcttctctgcctacatggtagcctgggttgtctctacaggagtggccatagc 
tgcagccgtttaUacctggctgagtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgt 
gtcctgcattctggccgtgccatgcatctactccatgttcaggctt^ 

tgatccgcaggggattgatgtagTTCTCAAGTATGGGATGTACAGATGGGCAGGCAGTGCAC 
30 GCACAAAGGCTCCTGGGCTGAGGACGGGACTGAAATCATCCAGCGTTCCCCTTA 
GTCAAGCTAAACATCTTTTTGAAAATAT 

GGCTCAACACCGTGGCCCTGTCTGGTGAAGAGTGTTGGGAAACCCTCATTGGCC 
AGGACATCTACCGGCTCCITCTGATGGATTTTGTGTTCrC 

CCTGGGGGAGTTTCTGAGGAGAATCATTGGGATGCAACTGATCACAAGTCTTGG 
35 CCTTCAGGAGTTTGACATTGCCAGGAACGTTCTAGAACTGATCT 

TCTGGTGTGGATTGGCATCTTCTTCTGCCCCCTGCTGCCCTTTATCCAjVATGAT^ 
ATGCITTTCATCATGTTCTACrrCCAAAuAATATCAGCCTGATGATGAA 
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CTCCGAGCAAAGCCTGGCGGGCCTCACAGATGATGACTTTCTTCATCTTCTTGC 
TCTTTTTCCCATCCTTCACCGGGGTCTTGTGCACCCTGGCCATCACCATCTGGA 
GATTGAAGCCITCAGCTGACTGTGGCCCTTTTCGAGGTCTGCCTCTCTTCATTC 
ACTCCATCTACAGCTGGATCGACACCCTAAGTACACGGCCTGGCTACCTGTGGG 
5 TTGTTTGGATCTATCGGAACCTCATTGGAAGTGTGCACTTCTTTTTCATCCTCAC 
CCTCATTGTGCTAATCATCACCTATCTTTACTGGCAGATCACAGAGGGAAGGAA 
GATTATGATAAGGCTGCTCCATGAGCAGATCATTAATGAGGGCAAAGATAAAAT 
GTTCCTGATAGAAAAATTGATCAAGCTGCAGGATATGGAGAAGAAAGCAAACCC 
CAGCTCACTTGTTCTGGAAAGGAGAGAGGTGGAGCAACAAGGCTTTTTGCATTT 

10 GGGGGAACATGATGGCAGTCTTGACTTGCGATCTAGAAGATCAGTTCAAGAAG 
GTAATCCAAGGGCCTGATGACTCTTTTGGTAACCAGACACCAATCAAATAAGGG 
GAGGAGACGAAAATGGAATGATTTCTTCCATGCCACCTGTGCCTTTAGGAACTG 
CCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGACTACCATGTAAT 
TATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGATTGCTGATTT 

1 5 TTCAAGACAAAATACTTGGGGTTTTCCAATAAAGATTGTTGTAATATTGAAATGA 
GCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATATTATCTTCAAG 
AAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCTCCTGCCAGACTTCCCAG 
ACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAAAAGTGGTCCATCCTGAATAAA 
CATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAACCTGCATTTA 

20 TTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGTGGAATACAACCA 
GAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGT Cl ' l ' l GGAGTCG 
GGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCATCAAAGTCCT 
ACGTCAAGCT 
SEQ ID NO:4 

25 

MAKYFRNNFINPHIYSGGITKI^ 

NSKLTFNQLLTRFSAYMVAWVVSTGVAIACCAAVYYLAEYNLEFLKTHSNPGAVLL 
LPFWSCILAVPCIYSMFRLVERYEMPRHEVYVLLIRRGLM 

30 SEQ ED NO:5 

atcttttgctgggacttcactgtcac'tcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgag 
ataagggagaacctgtcagagctccgtcaggagaattccaagttgacgttcaatcagctgctgacccgcttc 
tctgcctacatggtagcctgggttgtctctacaggagtggccatagcctgctgtgcagccgtttattacctg 
35 gctgagtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgtg 
tcctgcattaatctggccgtgccatgcatctactccatgttcaggcttgtggagaggtacgagatgccacgg 
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cacgaagtctacgttctcctgatccgaaacatctttttgaaaatatcaatcattggcattctttgttactat 
tggctcaacaccgtggccctgtctggtgaagagtgttgggaaaccctcattggccaggacatctaccggctc 
cttctgatggattttgtgttctctttagtcaattccttcctgggggagtttctgaggagaatcattgggatg 
caactgatcacaagtcttggccttcaggagtttgacattgccaggaacgttctagaactgatctatgcacaa 
5 actctggtgt.ggattggcatcttcttctgccccctgctgccctttatccaaatgattatgcttttcatcatg 
ttctactccaaaaatatcagcctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatg 
at gactttctt cat cttcttgctctttttcccatctttcaccggggtcttgtgcaccctggccat caeca tc 
tggagattgaagccttcagctgactgtggcccttttcgaggtctgcctctcttcattcactccatctacagc 
tggatcgacaccctaagtacacggcctggctacctgtgggttgtttggatctatcggaacctcattggaagt 

10 gtgcacttctttttcatcctcaccctcattgtgctgatcatcacctatctttactggcagatcacagaggga 
aggaagattatgataaggctgctccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaa 
aaattgatcaagctgcaggatatggagaagaaagcaaaccccagctcacttgttctggaaaggagagaggtg 
gagcaacaaggctttttgcatttgggggaacatgatggcagtcttgacttgcgatctagaagatcagttcaa 
gaaggtaatccaagggcctgaTGACTCTTTTGGTAACCAGACACCAATCAAATAAGGGGAGGAGATGAAAAT 

15 GGAATGATTTCTTCCATGCCACCTGTGCCTTTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAG 
CGGAAACTGACTACCATGTAATTATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGATTGCT 
GATTTTTCAAGACAAAATACTTGGGGTTTTCCAATAAAGATTGTTGTAATATTGAAATGAGCCTACAAAAAC 
CTAGGAAGAGATAACTAGGGAATAATGTATATTATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGA 
AATCTCTCCTGCCAGACTTCCCAGACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAAAAGTGGTCCATCCTG 

20 AATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAACCTGCATTTATTTGTGACTTTG 
AACTAAAGACATCCCCCATGTCCCAAAGGTGGAATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTAC 
TGATTACATGAGTCTTTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCATCAA 
AGTCCTACGTCAAGCT 

25 SEQ ID NO:6 

I FCWDFTVTHEKAVKLKQKNLSTE I RENLSELRQENS KLTFNQLLTRFSAYMVAWWSTGVAI ACCAAVYYL 
AEYNLEFLKTHSNPGAVLLLPFWSCINLAVPCIYSMFRLVERYEMPRHEVYVLLIRNIFLKISIIGILCYY 
WLNTVALSGEECWETLIGQDIYRLLLMDFVFSLVNSFLGEFLRRIIGMQLITSLGLQEFDIARNVLELIYAQ 
30 TLVWIGIFFCPLLPFIQMIMLFIMFYSKNISLMMNFQPPSKAWRASQMMTFFIFLLFFPSFTGVLCTLAITI 
WRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRPGYLWVVWIYRNLIGSVHFFFILTLIVLIITYLYWQITEG 
RKI MI RLLHEQI I NEGKDKMFL I EKLI KLQDMEKKAN PS S L VLERRE VEQQG FLHLGEHDGS LDLRS RRS VQ 
EGNPRA 

35 SEQ ID NO:7 

ctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatgatgacW^ 
cggggtcttgtgcaccctggccatcaccatctggagattgaagccttcag 

ctccatctacagctggatcgacaccctaagtacacggcctggctacctgtgggttgtttggatctatcggaacctcattggaagtgtgc 
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acttcttmcatcctcaccctcattgtgctgatcatcacctatctttactggcagatcacagagggaaggaagattatgataaggctgc 

tccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaaaaattgatcaagctgcaggatatggagaagaaagc 

aaaccccagctcacttgttctggaaaggagagaggtggagcaacaaggctttttgcatttgggggaacatgatggcagtcttgactt 

gcgatctagaagatcagttcaagaaggtaatccaagggcctgaTGACTCTTTTGGTAACCAGACACCAA^ 

5 CAAATAAGGGGAGGAGATGAAAATGGAATGATTTCTTCCATGCCACCTGTGCCT 

TTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGACT 

ACCATGTAATTATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGA 
TTGCTGATTTTTCAAGACAAAATACT 

ATTGAAATGAGCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATAT 
1 0 TATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCTCCTGCCA 
GACTTCCCAGACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAAAAGTGGTCCAT 
CCTGAATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAA 
CCTGCATTTATTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGTGG 
AATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCT 

15 TTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCAT 
CAAAGTCCTACGTCAAGCT 

SEQ ID NO:8 

20 LMMNFQPPSKAWRASQMMTFFIFLLFFPSFTGVLCTLAITIWRLKPSADCGPFRGLPLFIHSIYSWIDTLST 
RPGYLWVVWIYRNLIGSVHFFFILTLIVLIITYLYWQITEGRKIMIRLLHEQIINEGKDKMFLIEKLIKLQD 
MEKKANPSSLVLERREVEQQGFLHLGEHDGSLDLRSRRSVQEGNPRA 

SEQ ED NO:9 

25 

ITEGRKIMIR 

SEQBDNOrlO 

30 LLMDFVFSL 

SEQ ro NO: 11 

FLLFFPSFT 

35 

SEQ ID NO:12 

QMMTFFIFL 

40 SEQ ID NO: 13 

MMTFFIFLL 

SEQ ED NO: 14 

45 
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FLIEKLIKL 

SEQ ID NO:15 

VLLIRNIFL 

SEQ ID NO: 16 

LVWIGIFFC 

SEQ ID NO:17 

TLAITIWRL 

15 SEQIDNO:18 

LIFCWDFTV 

SEQ ID NO: 19 

20 

FLGEFLRRI 

SEQ ID NO:20 

25 LLLPFWSC 

SEQ ID NO:21 

LLTRFSAYM 

30 

SEQ ID NO:22 

KLIFCWDFT 

35 SEQ ID NO:23 

LLLMDFVFSL 

SEQ ED NO:24 

40 

LLMDFVFSLV 

SEQ ID NO:25 

45 QMMTFFIFLL 

SEQ ID NO:26 

YLAEYNLEFL 

50 

SEQ ID NO:27 

KLIFCWDFTV 

55 SEQ ID NO:28 

LLFFPSFTGV 
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SEQ ID NO:29 

QLLTRFSAYM 

5 SEQ ED NO:30 

TLVWIGIFFC 

SEQ ID NO:31 

10 

SQMMTFFIFL 

SEQ ID NO:32 

15 VLLLPFWSC 
SEQ ID NO:33 

ILCYYWLNTV 

20 

SEQ ID NO:34 

TLIVLIITYL 

25 SEQ ID NO:35 

YVLLIRNIFL 

SEQ ID NO:36 

30 

FVFSLVNSFL 

SEQ ID NO:37 
35 LLLPFWSCI 
SEQ ID NO:38 

LIGSVHFFFI 

40 

SEQ ID NO:39 

YEMPRHEVYV 

45 SEQ ID NO:40 

LLTRFSAYMV 

SEQ ID NO:41 

50 

IYSGGITKL 
SEQ ID NO:42 
55 VYYLAEYNL 

SEQ ID NO:43 
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YYWLNTVAL 

SEQ ED NO:44 

IYRLLLMDF 

SEQ ID NO:45 

* VYVLLIRNIF 

SEQ ID NO:46 

CYYWLNTVAL 

15 SEQIDNO:47 

YYLAEYNLEF 

SEQ ID NO:48 

20 

MLFIMFYSK 

SEQ ID NO:49 
25 LLIRNIFLK 

SEQ ID NO:50 

GLQEFDIAR 

30 

SEQ ID NO:51 

KMFLIEKLIK 

35 SEQ ID NO:52 

I MLFIMFYSK 

SEQ ID NO:53 

40 

KLIKLQDMEK 

SEQ ID NO:54 

45 VLLIRNIFLK 

SEQ ED NO:55 

MPRHEVYVL 

50 

SEQ ID NO:56 

MPRHEVYVLL 

55 SEQ ID NO:57 

YEMPRHEVY 

70 
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SEQ ED NO:58 

WETLIGQDIY 

5 SEQ ID NO:59 

VHFFFILTL 

SEQ ID NO:60 

10 

LRRIIGMQL 

SEQ ID NO:61 

15 LLMDFVFSL 

SEQ ID NO:62 

MQLITSLGL 

20 

SEQ ID NO:63 

FVFSLVNSF 
25 SEQ ID NO:64 
FILTLIVLI 

SEQ ID NO:65 

30 

WQITEGRKI 

SEQ ID NO:66 

35 MFYSKNISL 

SEQ ID NO:67 

FIMFYSKNI 

40 

SEQ ID NO:68 

WRASQMMTF 

45 SEQ ID NO:69 

FFFILTLIV 

SEQ ID NO:70 

50 

FRLVERYEM 

SEQ ID NO:71 

55 FFILTLIVL 

SEQ ID NO:72 

71 
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Claims 



least 70% 



1 . An isolated polypeptide comprising an amino acid sequence which has at 
identity to the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4 over, the entire 
length of of SEQ ID NO:2 or SEQ ID NO:4 respectively. 



2. An isolated polypeptide as claimed in claim 1 in which the amino acid sequence has at 
least 95% identity to SEQ ID NO:2 or SEQ ID NO:4. ' 



3. The polypeptide as claimed in claim 1 comprising the amino acid sequence of SEQ ID 
NO:2 or SEQIDNO:4. 



4. The isolated polypeptide of SEQ ID NO:2 or SEQ ID NO:4. 



5 . A polypeptide comprising an immunogenic fragment of a polypeptide as claimed in 
any one of claims 1 to 4 in which the immunogenic fragment (if necessary when coupled 
to a carrier) is capable of raising an immune response which recognises the polypeptide 
of SEQ ID NO:2 or SEQ ID NO:4. 



6. A polypeptide as claimed in any of claims 1 to 5 wherein said polypeptide is part of a 
larger fusion protein. 



7. A polypeptide as claimed in any of claims 1 to 6 chemically conjugated to a carrier 
protein. 



8. An isolated polynucleotide encoding a polypeptide as claimed in any of claims 1 to 6. 

■ 

9. An isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide 
that has at least 70% identity to the amino acid sequence of SEQ ID NO:2 of SEQ ED 
NO:4, over the entire length of SEQ ID NO:2 or SEQ ID NO:4 respectively; or a 
nucleotide sequence complementary to said isolated polynucleotide. 
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10. An isolated polynucleotide comprising a nucleotide sequence that has at least 70% 
identity to a nucleotide sequence encoding a polypeptide of SEQ ID NO:2 or SEQ ID NO:4, 
over the entire coding region; or a nucleotide sequence complementary to sa d isolated 
polynucleotide. 



1 1 . An isolated polynucleotide which comprises a nucleotide sequence which has at least 
70% identity to that of SEQ ID NO:l or SEQ ID NO:3 over the entire length of SEQ ID 
NO:l or SEQ ID NO:3 respectively; or a nucleotide sequence complementary to said 
isolated polynucleotide. 



mm 
mm 



12. The isolated polynucleotide as defined in any one of claims 9 to 11 in which the 
identity is at least 95%. 

13. An isolated polynucleotide selected from: 

(a) a polynucleotide comprising a nucleotide sequence encoding the polypeptide of SEQ 
ID NO:2 or SEQ ED NO:4; 

(b) the coding region of the polynucleotide of SEQ ID NO: 1 or SEQ ID NO:3; and 

(c) a polynucleotide obtainable by screening an appropriate library under stringent 
hybridization conditions with a labeled probe having the sequence of SEQ ID NO:l or SEQ 
ID NO: 3 or a fragment thereof said polynucleotide encoding a protein (if necessary when 
coupled to a carrier) capable of raising an immune response which recognises the protein 



of sequence ID NO:2 or SEQ ID NO:4 respectively or a nucleotide 
complementary to said isolated polynucleotide 



sequence 



14. An expression vector or a recombinant live microorganism comprising an isolated 
polynucleotide according to any one of claims 8-13. 

15. A host cell comprising the expression vector of claim 14 or the isolated 
polynucleotide of claims 8 to 13. 
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16. A process for producing a polypeptide of claims 1 to 7 comprising culturing a host 
cell of claim 15 under conditions sufficient for the production of said polypeptide and 
recovering the polypeptide from the culture medium. 



17. A vaccine comprising an effective amount of the polypeptide of any one of claims 1 
to 7 and a pharmaceutical^ acceptable carrier. 



18. A vaccine comprising an effective amount of the polynucleotide of any one of claims 
8 to 1 3 and a pharmaceutical^ effective carrier. 

1 9. A vaccine comprising an effective amount of antigen presenting cells, modified by in 
vitro loading with a polypeptide of any one of claims 1 to 7, or genetically modified in 
vitro to express a polypeptide of claims 1 to 7 and a pharmaceutical^ effective carrier. 

20. A vaccine as claimed in any one of claims 17 to 19 which additionally comprises a 
TH-1 inducing adjuvant. 

21. A vaccine as claimed in claim 20 in which the TH-1 inducing adjuvant is selected 
from the group of adjuvants comprising: 3D-MPL, QS21, a mixture of <jjS21 and 
cholesterol, and a CpG oligonucleotide. 

22. An antibody immunospecific for the polypeptide or immunological fragment as 
claimed in any one of claims 1 to 5. 

23. A method for screening to identify compounds which stimulate or which inhibit the 
function of the polypeptide of any one of claims 1 to 5 which comprises a method selected 
from the group consisting of: 
(a) measuring the binding of a candidate compound to the said polypeptide (or to the 
cells or membranes bearing the polypeptide) or a fusion protein thereof by means of a 
label directly or indirectly associated with the candidate compound; 
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(b) measuring the binding of a candidate compound to the said polypeptide (or to the 
cells or membranes bearing the polypeptide) or a fusion protein thereof in the presence of 
a labeled competitior; 

(c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the said polypeptide, using detection systems appropriate to the cells or cell 
membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing a polypeptide of any one of 
claims 1 to 7, to form a mixture, measuring activity of the polypeptide in the mixture, and 
comparing the activity of the mixture to a standard; or 

(e) detecting the effect of a candidate compound on the production of mRNA encoding 
said polypeptide and said polypeptide in cells, using for instance, an ELIS A assay, 

24. A method for the treatment of a subject by immunoprophylaxis or therapy 
comprising in vitro induction of immune responses to a molecule of any one olf claims 1 
to 5, using in vitro incubation of the polypeptide of any one of claims 1 to 7 or the 
polynucleotide of any one of claims 8 to 1 3 with cells from the immune system of a 
mammal, and reinfusing these activated immune cells to the mammal for the treatment of 
disease. 



25. A method as claimed in claim 24 wherein the treatment is for ovarian 



cancer. 



26. An agonist or antagonist to the polypeptide of claims 1 to 5. 



27. A compound which is: 

(a) an agonist or antagonist to the polypeptide of claims 1 to 5; 

(b) isolated polynucleotide of claims 8 to 13; or 

(c) a nucleic acid molecule that modulates the expression of the nucleotide 
encoding the polypeptide of any one of claims 1 to 5; 

for use in therapy. 



or colon 



sequence 
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28. A process for diagnosing a disease or a susceptibility to a disease in a subject related 
to expression or activity of a polypeptide of any one of claims 1 to 5 in a subject 
comprising analyzing for the presence or amount of said polypeptide in a sample derived 
from said subject. 



29. A process for diagnosing a disease or a susceptibility to a disease in a subject 

A 

to expression or activity of a polynucleotide of any one of claims 8 to 13 ir 
comprising analyzing for the presence or amount of said polynucleotide in 
derived from said subject. 



related 
a subject 
a sample 



30. A process for diagnosing the presence of colon cancer or a susceptibility to colon 
cancer in a subject related to expression or activity of a polypeptide of any one of claims 1 
to 5 in a subject comprising analyzing for the presence or amount of said polypeptide in a 
sample derived from said subject. 

31. A process for diagnosing the presence of colon cancer or a susceptibility to colon 
cancer in a subject related to expression or activity of a polynucleotide of any one of 
claims 8 to 13 in a subject comprising analyzing for the presence or amount of said 
polynucleotide in a sample derived from said subject. 



least 70% 
:5 or SEQ 



32. An isolated polynucleotide selected from the group consisting of: 

(a) an isolated polynucleotide comprising a nucleotide sequence which has at 
identity to SEQ ID NO:5 or SEQ ID NO:7 over the entire length of SEQ ID NO 
ID NO: 7 respectively; 

(b) an isolated polynucleotide comprising the polynucleotide of SEQ ID NO:5 or SEQ ID 
NO:7; 

(c) the polynucleotide of SEQ ID NO:5 or SEQ ID NO:7. 



33. A live vaccine composition comprising an expression vector or recombinant live 
micro-organism according to claim 14. 



34. Use of a polynucleotide as claimed in any one of claims 8 to 13 for the manufacture 
of a medicament in the treatment of carcinoma. I 



35. Use of a polynucleotide as claimed in any one of claims 8 to 13 for the manufacture 
of a medicament in the treatment of colon carcinoma. 



36. Use of a polypeptide as claimed in any one of claims 1 to 7 for the manufacture of a 
medicament in the treatment of carcinoma. 



37. Use of a polypeptide as claimed in any one of claims 1 to 7 for the manufacture of a 

> 

medicament in the treatment of colon carcinoma. 
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Figure It RT PCR data of CASB641 1 transcript on colorectal tumours 
and matched normal colon. 
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Legend: 

Ad_Gl: adrenal gland; 
Ao: aorta; Bl: bladder, 

Bo_Ma: bone marrow; 
Bra: brain; 

Ce: cervix; Co: colon; 

Fa_Tu: fallopian tube; He: heart; 

II: ileon; Ki: kidney; Li: liver; 

Lu: lung; Ly_No: lymph node; 

Oe: oesophagus; Ov: ovary; 

PaJThy: parathyroid gland; 

PI: placenta; Pr: prostate; 

Re: rectum; Sk: skin; 

Sk_Mu: skeletal muscle; 

Sm_In: small intestine; 

Sp: spleen; St: stomach; Te: testis; 

Thy: thyroid gland; Tra: trachea; 

Bre: Breast. 



Figure 2 : RT PCR data of CASB7641 1 
transcript in a panel of normal tissues. 
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<130> BC45263 



<160> 72 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 



<400> 1 



tggggaggca 
aaaaaaaaat 
aggctgagaa 
caagccaggc 
ccgctccagg 
gggctgccct 
ctgctttctg 
atttactccg 
aaagctgtga 
ctccgtcagg 
atggtagcct 
ctggctgagt 
ctgcctttcg 
cttgtggaga 
tttttgaaaa 
tctggtgaag 
gattttgtgt 
atgcaactga 
ctgatctatg 
atccaaatga 
ttccagcctc 
ctctttttcc 
aagccttcag 
agctggatcg 
aacctcattg 
tatctttact 
atcattaatg 
atggagaaga 
ggctttttgc 
caagaaggta 
ggaggagacg 
aagaaaatcc 
aattgggcat 

tggggttttc 

gataactagg 
agaaatctct 
aagtggtcca 
tagaacctgc 
atacaaccag 
tcggggatgg 
tcaagct 



gaaggcagac 
tctgtgaggt 
tcagaaccat 
ctgtgcagcc 
ccaacagctt 
ttgttgaagc 
ccttctctag 
gagggatcac 
agctaaaaca 
agaattccaa 
gggttgtctc 
acaacttaga 
ttgtgtcctg 
ggtacgagat 
tatcaatcat 
agtgttggga 
tctctttagt 
tcacaagtct 
cacaaactct 
ttatgctttt 
cgagcaaagc 
catccttcac 
ctgactgtgg 
acaccctaag 
gaagtgtgca 
ggcagatcac 
agggcaaaga 
aagcaaaccc 
atttggggga 
atccaagggc 
aaaatggaat 
aaggctttag 
tccatgctat: 
caataaagat 
gaataatgta 
cctgccagac 
tcctgaataa 
atttatttgt 
aggtctcatc 
aggaggttct 



tgatcacttg 
gagttttatt 
tcacaagaca 
ccaaaaccag 
ggggctaagt 
cctgcactcc 
catggccaag 
caagctgatc 
gaagaatctt 
gttgacgttc 
tacaggagtg 
gttcctgaag 
cattaatctg 
gccacggcac 
tggcattctt 
aaccctcatt 
caattccttc 
tggccttcag 
ggtgtggatt 
catcatgttc 
ctggcgggcc 
cggggtcttg 
cccttttcga 
tacacggcct 
cttctttttc 
agagggaagg 
taaaatgttc 
cagctcactt 
acatgatggc 
ctgatgactc 
gatttcttcc 
ccaggagcgg 
ttttaatacc 
tgttgtaata 
tattatcttc 
ttcccagacc 
acatgtaata 
gactttgaac 
tctgaacttt 
gcccctgtga 



aggccaggag 
gttattccct 
aaaatccccc 
tgcttgttta 
cttcacgttg 
cagtgacggc 
tatttccgga 
ttttgctggg 
agcactgaga 
aatcagctgc 
gccatagcct 
acacacagta 
gccgtgccat 
gaagtctacg 
tgttactatt 
ggccaggaca 
ctgggggagt 
gagtttgaca 
ggcatcttct 
tactccaaaa 
tcacagatga 
tgcaccctgg 
ggtctgcctc 
ggctacctgt 
atcctcaccc 
aagattatga 
ctgatagaaa 
gttctggaaa 
agtcttgact 
ttttggtaac 
atgccacctg 
aaactgacta 
tggattgctg 
ttgaaatgag 
aagaaatgtg 
tggcaaaggt 
ctccagcagg 
taaagacatc 
cttgcgtact 
ggtgttatac 



tttgagacct 
ctctacagat 
agttggcaga 
accactgtgt 
cctttcacca 
cataataacc 
acaacttcat 
acttcactgt 
taagggagaa 
tgacccgctt 
gctgtgcagc 
accctggggc 
gcatctactc 
ttctcctgat 
ggctcaacac 
tctaccggct 
ttctgaggag 
ttgccaggaa 
tctgccccct 
atatcagcct 
tgactttctt 
ccatcaccat 
tcttcattca 
gggttgtttg 
tcatrgtgct 
taaggctgct 
aattgatcaa 
ggagagaggt 
tgcgatctag 
cagacaccaa 
tgcctttagg 
ccatgtaatt 
atttttcaag 
cctacaaaaa 
tgcaggaatg 
ttagaaactg 
gatatgaagc 
ccccatgtcc 
gattacatga 
atgaccatca 



catgtctaaa 
atggaaactg 
tccagggttg 
ggtgaccaca 
ttaaataata 
ttcaggtgtt 
taatccccac 
cactcatgaa 
cctgtcagag 
ctctgcctac 
cgtttattac 
ggtgctgtta 
catgttcagg 
ccgaaacatc 
cgtggccctg 
ccttctgatg 
aatcattggg 
cgttctagaa 
gctgcccttt 
gatgatgaat 
catcttcttg 
ctggagattg 
ctccatctac 
gatctatcgg 
aatcatcacc 
ccatgagcag 
gctgcaggat 
ggagcaacaa 
aagatcagtt 
tcaaataagg 
aactgcccag 
atcaaagtaa 
acaaaatact 
cctaggaaga 
attggttctt 
ttgctaagaa 
ctctgaattg 
caaaggtgga 
gtctttggag 
aagtcctacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2407 



65 



<210> 2 
<211> 460 
<212> PRT 

<213> Artificial Sequence 



1 



nil i\ n ,i* 'ij »i* ii ,i- h — ,}■ « ii - 4i 5 ir"^H 1! 'I d< *-« «il ft 1? 

*t,d„ U Ji»..ii JJ...H ..si' .- il„.f' i:: il„ T T' » ^.^ii ~,ti.< ;L™. 

WO 01/23417 PCT/EP00/09500 







<400> 


2 

>*— 




























Met 


Ala 


Lys 


Tvr 

jr 


Phe 


Ara 


Asn 


Asn 


Phe 


He 


Asn 


Pro 

A» A 


His 


He 


Tvr 

* Jr *> 


Ser 




1 








5 










10 










15 




5 

— / 


Glv 


Gly 


He 


Thr 


Lvs 


Leu 


He 


Phe 


Cvs 


Trp Asp 


Phe 

A A 


Thr 


Val 


Thr 

^ AAA. 


His 










20 










25 










30 








Glu 

\j -Jt w 


Lys 


Ala 


Val 


Lvs 


Leu 


Lvs 


Gin 

A A 


Lvs 


Asn 


Leu 


Ser 


Thr 


Glu 


He 


Arcr 

* * A- y 








35 










40 










45 










Glu 


Asn 


Leu 


Ser 


Glu 


Leu 


Arg 


Gin 


Glu 


Asn 


Ser 


Lvs 


Leu 


Thr 


Phe 


Asn 

A A*a? «. A. 


10 




50 










55 










60 












Gin 

\— / -A_ & 4 


Leu 


Leu 


Thr 

* ** A- 


Arg 


Phe 


Ser 


Ala 


Tvr 

Jr 


Met 


Val 


Ala 


Trp 


Val 


Val 

w (A 


Ser 

W A- 




65 










70 










75 










80 




Thr 

« * a a- 


Gly 


Val 


Ala 


He 


Ala 


Cvs 


Cvs 

^ jr 


Ala 


Ala 


Val 


Tvr 

Jr 


Tyr 


Leu 


Ala 


Glu 

-A. U 












85 










90 










95 




15 


Tvr 

Jr 


Asn 


Leu 


Glu 


Phe 


Leu 


LVS 
jr 


Thr 


His 


Ser 


Asn 


Pro 


Gly Ala 


Val 

▼ *-A «A» 


Leu 

X-J W u 










100 

<X. v v 










105 










110 








Leu 


Leu 


Pro 


Phe 

At X X W 


Val 


Val 


Ser 

*— ' ^— » A. 


Cvs 


He 


Asn 


Leu 


Ala 


Val 


Pro 




Tie 








115 










120 










125 










Tvr 


Ser 


Met 


Phe 


Arg 


Leu 


Val 


Glu 


Ara 


Tvr 

Jr 


Glu 


Met 


Pro 


Arg 


His 


Glu 

\J ^ La 


20 




130 










135 










140 












Val 


Tvr 


Val 


Leu 


Leu 


He 


Ara 

A A A SJ 


Asn 

4 fcw A A 


He 


Phe 


Leu 

Ai^ VA 


Lys 


He 


Ser 


He 

-A. -JL. \_- 


He 




145 










150 










155 










160 




Glv 


He 


Leu 


Cvs 


Tvr 

Jr 


Tvr 

Jr 


Trrj 


Leu 

AJV V^- 


Asn 

A*w XX 


Thr 

A* AA A- 


Val 

w ^-A 


Ala 


Leu 


Ser 


Glv 


Glu 

VJ 1 u 












165 










170 

-A- • 










175 




25 


Glu 

w X IX 


Cvs 


Trp 


Glu 


Thr 


Leu 


He 

<A» 


Glv 


Gin 

SJ -A. A A 


Asn 


He 

-A- ^ ^ 


Tyr Arg 


Leu 


Leu 


Le\3 










180 










185 

KJ -m^ 










190 








Met 


Asp 


Phe 


Val 


Phe 

A> A A V^> 


Ser 


A. J Vv> U 


Val 

V U -A. 


Asn 

n uSt XX 


Ser 


Phe 

A, A A 


Leu 


Gly 


Glu 


Phe 


T.P»n 
ucu 








195 










200 










205 










Mi y 


Ara 


He 


Tie 

IXC 


Glv 


Met" 


Gin 


T,pt1 


Tie 


Thr 

X 4lA> 


«*J w XV 


Leu 


Gly 


Leu 




Glu 


30 




210 










215 
-i- ~j 










220 












Phe 


Asd 


He 

A 


Ala 




Asn 


Val 


Leu 


Glu 

V* J» 


Leu 

Arf M 


He 


Tyr 


Ala 


Gin 


Thr 


T,pn 
UC LI 




£. -J 










t. J w 










235 














Val 




He 


Glv 


He 


Phe 


Phe 


Cvs 


Pro 


Aj\i IX 


Leu 


Pro 


Phe 


He 


Gin 


Met 












245 










250 










£. J J 




35 


lie 

^k> 


Met 


Leu 


Phe 


He 


Met 

A A W 


Phe 


Tyr 


Ser 


Lvs 


Asn 


He 


Ser 


Leu 


Met 


Met 










2 60 










265 










270 










Phe 


Gin 


Prn 

C -L. \J 




Cp r* 

W C Am 


Lv«5 


Ala 


Trp 


Ara 


Ala 

AA^U LAi 


Ser 


Gin 


Met 




nil 








275 










280 










285 










Phe 

A> • a « 


Phe 


He 


Phe 


Leu 


Lpu 

W VpX 


Phe 

A> If W 


Phe 


Pro 


Ser 


Phe 

A> A A ^« 


Thr 


Gly Val 


Leu 


v<* jr o 


40 




290 










295 










300 










Thr 

A 44 Ab 


Leu 


Ala 


He 


Thr 

A AA A> 


He 

>i» 


Tro 


Arg 


Leu 


aJ j O 


Pro 

X» A. \^ 


Ser 


Ala 


Asp 


Cys 


Glv 




305 










310 










315 










"?70 

J \J 




Pro 

A» A» V* 


Phe 


Aro 


Glv 


Leu 


Pro 

A* Ai 


Leu 


Phe 


He 


His 


Ser 


He 


Tyr 


Ser 


Trp 


He 
1 1 w 












325 










330 










335 




45 


Asd 


Thr 


Leu 


fier 

4j W <i- 


Thr 

X XX a. 


Ara 


Pro 

A- A» V-^ 


Gly Tyr 


Leu 

AJ& U 


Tm 


Val 


Val 


Trp 


He 


iyr 










340 

•f w 










345 










350 








Aro 


Asn 


Leu 


He 


Glv 


Ser 


Val 


His 


Phe 


Phe 


Phe 


He 


Leu 


Thr 


Leu 


He 








355 










360 










365 








50 


Val 

W u -Ah 


Leu 


Tie 


Tie 


Thr 

X 11 A> 


Tvr 

Jr 


Leu 


Tyr 


Trp 


Gin 


He 


Thr 


Glu Gly Arg 






370 




















380 












He 


Met 


He 


Arg 


Leu 


Leu 


His 


Glu 


Gin 


He 


He 


Asn 


Glu 


Gly 


Lys 


Asp 




385 










390 










395 










400 




Lys 


Met 


Phe 


Leu 


He 


Glu 


Lys 


Leu 


He 


Lys 


Leu 


Gin 


Asp 


Met 


Glu 


Lys 












405 










410 










415 




55 


Lys 


Ala 


Asn 


Pro 


Ser 


Ser 


Leu 


Val 


Leu 


Glu 


Arg 


Arg 


Glu 


Val 


Glu 


Gin 










420 










425 










430 








Gin 


Gly 


Phe 


Leu 


His 


Leu 


Gly 


Glu 


His 


Asp 


Gly 


Ser 


Leu 


Asp 


Leu 


Arg 








435 










440 










445 










Ser 


Arg 


Arg 


Ser 


Val 


Gin 


Glu 


Gly Asn 


Pro 


Arg 


Ala 










60 




450 










455 










4 60 











<210> 3 
<211> 2521 
<212> DNA 

65 <213> Artificial Sequence 



2 
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<400> 3 

tggggaggca gaaggcagac tgatcacttg aggccaggag tttgagacct catgtctaaa 60 

aaaaaaaaat tctgtgaggt gagttttatt gttattccct ctctacagat atggaaactg 120 

aggctgagaa tcagaaccat tcacaagaca aaaatccccc agttggcaga tccagggttg 180 

5 caagccaggc ctgtgcagcc ccaaaaccag tgcttgttta accactgtgt ggtgaccaca 240 

ccgctccagg ccaacagctt ggggctaagt cttcacgttg cctttcacca ttaaataata 300 

gggctgccct ttgttgaagc cctgcactcc cagtgacggc cataataacc ttcaggtgtt 360 

ctgctttctg ccttctctag catggccaag tatttccgga acaacttcat taatccccac 420 

atttactccg gagggatcac caagctgatc ttttgctggg acttcactgt cactcatgaa 4 80 

10 aaagctgtga agctaaaaca gaagaatctt agcactgaga taagggagaa cctgtcagag 54 0 

ctccgtcagg agaattccaa gttgacgttc aatcagctgc tgacccgctt ctctgcctac 600 

atggtagcct gggttgtctc tacaggagtg gccatagcct gctgtgcagc cgtttattac 660 

ctggctgagt acaacttaga gttcctgaag acacacagta accctggggc ggtgctgtta 720 

ctgcctttcg ttgtgtcctg cattctggcc gtgccatgca tctactccat gttcaggctt 780 

15 gtggagaggt acgagatgcc acggcacgaa gtctacgttc tcctgatccg caggggattg 840 

atgtagttct caagtatggg atgtacagat gggcaggcag tgcacgcaca aaggctcctg 900 

ggctgaggac gggactgaaa tcatccagcg ttccccttag tcaagctaaa catctttttg 960 

aaaatatcaa tcattggcat tctttgttac tattggctca acaccgtggc cctgtctggt 1020 

gaagagtgtt gggaaaccct cattggccag gacatctacc ggctccttct gatggatttt 1080 

20 gtgttctctt tagtcaattc cttcctgggg gagtttctga ggagaatcat tgggatgcaa 1140 

ctgatcacaa gtcttggcct tcaggagttt gacattgcca ggaacgttct agaactgatc 1200 

tatgcacaaa ctctggtgtg gattggcatc ttcttctgcc ccctgctgcc ctttatccaa 1260 

atgattatgc ttttcatcat gttctactcc aaaaatatca gcctgatgat gaatttccag 1320 

cctccgagca aagcctggcg ggcctcacag atgatgactt tcttcatctt cttgctcttt 1380 

25 ttcccatcct tcaccggggt cttgtgcacc ctggccatca ccatctggag attgaagcct 1440 

tcagctgact gtggcccttt tcgaggtctg cctctcttca ttcactccat ctacagctgg 1500 

atcgacaccc taagtacacg gcctggctac ctgtgggttg tttggatcta tcggaacctc 1560 

attggaagtg tgcacttctt tttcatcctc accctcattg tgctaatcat cacctatctt 1620 

tactggcaga tcacagaggg aaggaagatt atgataaggc tgctccatga gcagatcatt 1680 

30 aatgagggca aagataaaat gttcctgata gaaaaattga tcaagctgca ggatatggag 1740 

aagaaagcaa accccagctc acttgttctg gaaaggagag aggtggagca acaaggcttt 1800 

ttgcatttgg gggaacatga tggcagtctt gacttgcgat ctagaagatc agttcaagaa 1860 

ggtaatccaa gggcctgatg actcttttgg taaccagaca ccaatcaaat aaggggagga 1920 

gacgaaaatg gaatgatttc ttccatgcca cctgtgcctt taggaactgc ccagaagaaa 1980 

35 atccaaggct ttagccagga gcggaaactg actaccatgt aattatcaaa gtaaaattgg 2040 

gcattccatg ctatttttaa tacctggatt gctgattttt caagacaaaa tacttggggt 2100 

tttccaataa agattgttgt aatattgaaa tgagcctaca aaaacctagg aagagataac 2160 

tagggaataa tgtatattat cttcaagaaa tgtgtgcagg aatgattggt tcttagaaat 2220 

ctctcctgcc agacttccca gacctggcaa aggtttagaa actgttgcta agaaaagtgg 2280 

40 tccatcctga ataaacatgt aatactccag cagggatatg aagcctctga attgtagaac 2340 

ctgcatttat ttgtgacttt gaactaaaga catcccccat gtcccaaagg tggaatacaa 2400 

ccagaggtct catctctgaa ctttcttgcg tactgattac atgagtcttt ggagtcgggg 24 60 

atggaggagg ttctgcccct gtgaggtgtt atacatgacc atcaaagtcc tacgtcaagc 2520 

t ' " 2521 



45 

<210> 4 
<211> 154 
<212> PRT 

<213> Artificial Sequence 

50 







<400> 


4 




























Met 


Ala 


Lys 


Tyr 


Phe 


Arg 


Asn 


Asn 


Phe 


lie 


Asn 


Pro 


His 


He 


Tyr 


Ser 




1 








5 










10 










15 






Gly 


Gly 


He 


Thr 


Lys 


Leu 


He 


Phe 


Cys 


Trp 


Asp 


Phe 


Thr 


Val 


Thr 


His 


55 








20 










25 










30 








Glu 


Lys 


Ala 
35 


Val 


Lys 


Leu 


Lys 


Gin 
40 


Lys 


Asn 


Leu 


Ser 


Thr 
45 


Glu 


He 


Arg 




Glu 


Asn 
50 


Leu 


Ser 


Glu 


Leu 


Arg 
55 


Gin 


Glu 


Asn 


Ser 


Lys 
60 


Leu 


Thr 


Phe 


Asn 


60 


Gin 
65 


Leu 


Leu 


Thr 


Arg 


Phe 
70 


Ser 


Ala 


Tyr 


Met 


Val 
75 


Ala 


Trp 


Val 


Val 


Ser 
80 




Thr 


Gly 


Val 


Ala 


He 
85 


Ala 


Cys 


Cys 


Ala 


Ala 
90 


Val 


Tyr 


Tyr 


Leu 


Ala 
95 


Glu 




Tyr 


Asn 


Leu 


Glu 


Phe 


Leu 


Lys 


Thr 


His 


Ser 


Asn 


Pro 


Gly 


Ala 


Val 


Leu 


65 








100 










105 










110 








Leu 


Leu 


Pro 


Phe 


Val 


Val 


Ser 


Cys 


He 


Leu 


Ala 


Val 


Pro 


Cys 


He 


Tyr 



3 
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WO 01/23417 
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10 



115 120 125 

Ser Met Phe Arg Leu Val Glu Arg Tyr Glu Met Pro Arg His Glu Val 

130 135 140 

Tyr Val Leu Leu lie Arg Arg Gly Leu Met 
145 150 

<210> 5 
<211> 1960 
<212> DNA 

<213> Artificial Sequence 



15 



20 



25 



30 



35 



40 



45 



<400 
atcttttgct 
cttagcactg 
ttcaatcagc 
gtggccatag 
aagacacaca 
ctggccgtgc 
cacgaagtct 
ctttgttact 
attggccagg 
ttcctggggg 
caggagtttg 
attggcatct 
ttctactcca 
gcctcacaga 
ttgtgcaccc 
cgaggtctgc 
cctggctacc 
ttcatcctca 
aggaagatta 
ttcctgatag 
cttgttctgg 
ggcagtcttg 
ctcttttggt 
tccatgccac 
cggaaactga 
acctggattg 
atattgaaat 
ttcaagaaat 
acctggcaaa 
atactccagc 
aactaaagac 
tttcttgcgt 
tgaggtgtta 



> 5 
gggacttcac 
agataaggga 
tgctgacccg 
cctgctgtgc 
gtaaccctgg 
catgcatcta 
acgttctcct 
attggctcaa 
acatctaccg 
agtttctgag 
acattgccag 
tcttctgccc 
aaaatatcag 
tgatgacttt 
tggccatcac 
ctctcttcat 
tgtgggttgt 
ccctcattgt 
tgataaggct 
aaaaattgat 
aaaggagaga 
acttgcgatc 
aaccagacac 
ctgtgccttt 
ctaccatgta 
ctgatttttc 
gagcctacaa 
gtgtgcagga 
ggtttagaaa 
agggatatga 
atcccccatg 
actgattaca 
tacatgacca 



tgtcactcat 
gaacctgtca 
cttctctgcc 
agccgtttat 
ggcggtgctg 
ctccatgttc 
gatccgaaac 
caccgtggcc 
gctccttctg 
gagaatcatt 
gaacgttcta 
cctgctgccc 
cctgatgatg 
cttcatcttc 
catctggaga 
tcactccatc 
ttggatctat 
gctgatcatc 
gctccatgag 
caagctgcag 
ggtggagcaa 
tagaagatca 
caatcaaata 
aggaactgcc 
attatcaaag 
aagacaaaat 
aaacctagga 
atgattggtt 
ctgttgctaa 
agcctctgaa 
tcccaaaggt 
tgagtctttg 
tcaaagtcct 



gaaaaagctg 

gagctccgtc 

tacatggtag 

tacctggctg 

ttactgcctt 

aggcttgtgg 

atctttttga 

ctgtctggtg 

atggattttg 

gggatgcaac 

gaactgatct 

tttatccaaa 

aatttccagc 

ttgctctttt 

ttgaagcctt 

tacagctgga 

cggaacctca 

acctatcttt 

cagatcatta 

gatatggaga 

caaggctttt 

gttcaagaag 

aggggaggag 

cagaagaaaa 

taaaattggg 

acttggggtt 

agagataact 

cttagaaatc 

gaaaagtggt 

ttgtagaacc 

ggaatacaac 

gagtcgggga 

acgtcaagct 



tgaagctaaa 
aggagaattc 
cctgggttgt 
agtacaactt 
tcgttgtgtc 
agaggtacga 
aaatatcaat 
aagagtgttg 
tgttctcttt 
tgatcacaag 
atgcacaaac 
tgattatgct 
ctccgagcaa 
tcccatcttt 
cagctgactg 
tcgacaccct 
ttggaagtgt 
actggcagat 
atgagggcaa 
agaaagcaaa 
tgcatttggg 
gtaatccaag 
atgaaaatgg 
tccaaggctt 
cattccatgc 
ttccaataaa 
agggaataat 
tctcctgcca 
ccatcctgaa 
tgcatttatt 
cagaggtctc 
tggaggaggt 



acagaagaat 
caagttgacg 
ctctacagga 
agagttcctg 
ctgcattaat 
gatgccacgg 
cattggcatt 
ggaaaccctc 
agtcaattcc 
tcttggcctt 
tctggtgtgg 
tttcatcatg 
agcctggcgg 
caccggggtc 
tggccctttt 
aagtacacgg 
gcacttcttt 
cacagaggga 
agataaaatg 
ccccagctca 
ggaacatgat 
ggcctgatga 
aatgatttct 
tagccaggag 
tatttttaat 
gattgttgta 
gtatattatc 
gacttcccag 
taaacatgta 
tgtgactttg 
atctctgaac 
tctgcccctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 



50 



<210> 6 
<211> 438 
<212> PRT 

<213> artifical sequence 



55 



60 



65 



He 

1 
Lys 

Arg 

Ser 

Cys 

65 

Lys 

Ser 



<400> 
Phe Cys 

Gin Lys 

Gin Glu 
35 

Ala Tyr 
50 

Cys Ala 
Thr His 
Cys He 



Trp Asp Phe 
5 

Asn Leu Ser 
20 

Asn Ser Lys 

Met Val Ala 

Ala Val Tyr 
70 

Ser Asn Pro 
85 

Asn Leu Ala 
100 



Thr Val Thr His 

10 

Thr Glu He Arg 
25 

Leu Thr Phe Asn 
40 

Trp Val Val Ser 
55 

Tyr Leu Ala Glu 

Gly Ala Val Leu 

90 

Val Pro Cys He 
105 



Glu Lys Ala 

Glu Asn Leu 

Gin Leu Leu 
45 

Thr Gly Val 
60 

Tyr Asn Leu 
75 

Leu Leu Pro 
Tyr Ser Met 



Val Lys Leu 
15 

Ser Glu Leu 
30 

Thr Arg Phe 

Ala He Ala 

Glu Phe Leu 
80 

Phe Val Val 
95 

Phe Arg Leu 
110 



4 



-„;i„ H,*3 l si»«r n.„3* ,,„,;! «j lk*Jt 3i™J- „4i„ 
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Val Glu Arg Tyr Glu Met Pro Arg His Glu Val Tyr Val Leu Leu lie 

115 120 125 

Arg Asn lie Phe Leu Lys lie Ser lie He Gly He Leu Cys Tyr Tyr 
130 135 140 

5 Trp Leu Asn Thr Val Ala Leu Ser Gly Glu Glu Cys Trp Glu Thr Leu 
145 150 155 160 

He Gly Gin Asp He Tyr Arg Leu Leu Leu Met Asp Phe Val Phe Ser 

165 170 175 

Leu Val Asn Ser Phe Leu Gly Glu Phe Leu Arg Arg He He Gly Met 
10 180 185 190 

Gin Leu He Thr Ser Leu Gly Leu Gin Glu Phe Asp He Ala Arg Asn 

195 200 205 

Val Leu Glu Leu He Tyr Ala Gin Thr Leu Val Trp He Gly lie Phe 
210 215 220 

15 Phe Cys Pro Leu Leu Pro Phe He Gin Met He Met Leu Phe He Met 
225 230 235 240 

Phe Tyr Ser Lys Asn He Ser Leu Met Met Asn Phe Gin Pro Pro Ser 

245 250 255 

Lys Ala Trp Arg Ala Ser Gin Met Met Thr Phe Phe He Phe Leu Leu 
20 260 265 270 

Phe Phe Pro Ser Phe Thr Gly Val Leu Cys Thr Leu Ala He Thr He 

275 280 285 

Trp Arg Leu Lys Pro Ser Ala Asp Cys Gly Pro Phe Arg Gly Leu Pro 
290 295 300 

25 Leu Phe He His Ser He Tyr Ser Trp He Asp Thr Leu Ser Thr Arg 
305 310 315 320 

Pro Gly Tyr Leu Trp Val Val Trp He Tyr Arg Asn Leu He Gly Ser 

325 330 335 

Val His Phe Phe Phe He Leu Thr Leu lie Val Leu He He Thr Tyr 
30 340 345 350 

Leu Tyr Trp Gin He Thr Glu Gly Arg Lys He Met He Arg Leu Leu 

355 360 365 

His Glu Gin He He Asn Glu Gly Lys Asp Lys Met Phe Leu lie Glu 
370 375 380 

35 Lys Leu He Lys Leu Gin Asp Met Glu Lys Lys Ala Asn Pro Ser Ser 
385 390 395 400 

Leu Val Leu Glu Arg Arg Glu Val Glu Gin Gin Gly Phe Leu His Leu 

405 410 415 

Gly Glu His Asp Gly Ser Leu Asp Leu Arg Ser Arg Arg Ser Val Gin 
40 420 425 430 

Glu Gly Asn Pro Arg Ala 
435 

<210> 7 
45 <21i> 1219 

<212> DNA 

<213> Artificial Sequence 
<400> 7 

50 ctgatgatga atttccagcc tccgagcaaa gcctggcggg cctcacagat gatgactttc 60 

ttcatcttct tgctcttttt cccatctttc accggggtct tgtgcaccct ggccatcacc 120 

atctggagat tgaagccttc agctgactgt ggcccttttc gaggtctgcc tctcttcatt 180 

cactccatct acagctggat cgacacccta agtacacggc ctggctacct gtgggttgtt 240 

tggatctatc ggaacctcat tggaagtgtg cacttctttt tcatcctcac cctcattgtg 300 

55 ctgatcatca cctatcttta ctggcagatc acagagggaa ggaagattat gataaggctg 360 

ctccatgagc agatcattaa tgagggcaaa gataaaatgt tcctgataga aaaattgatc 420 

aagctgcagg atatggagaa gaaagcaaac cccagctcac ttgttctgga aaggagagag 4 80 

gtggagcaac aaggcttttt gcatttgggg gaacatgatg gcagtcttga cttgcgatct 54 0 

agaagatcag ttcaagaagg taatccaagg gcctgatgac tcttttggta accagacacc 600 

60 aatcaaataa ggggaggaga tgaaaatgga atgatttctt ccatgccacc tgtgccttta 660 

ggaactgccc agaagaaaat ccaaggcttt agccaggagc ggaaactgac taccatgtaa 720 

ttatcaaagt aaaattgggc attccatgct atttttaata cctggattgc tgatttttca 780 

agacaaaata cttggggttt tccaataaag attgttgtaa tattgaaatg agcctacaaa 840 

aacctaggaa gagataacta gggaataatg tatattatct tcaagaaatg tgtgcaggaa 900 

65 tgattggttc ttagaaatct ctcctgccag acttcccaga cctggcaaag gtttagaaac 960 

tgttgctaag aaaagtggtc catcctgaat aaacatgtaa tactccagca gggatatgaa 1020 



5 



10 



45 



50 



.^iL, ILrJi H..,}' .J 1 i - ii,,.}' ,.„,>i ju 45„.ji ,fc ti,,,! 1 .„>L. 11.,... 

WO 01/23417 PCTpPOO/09500 

gcctctgaat tgtagaacct gcatttattt gtgactttga actaaagaca tcccccatgt 1080 

cccaaaggtg gaatacaacc agaggtctca tctctgaact ttcttgcgta ctgattacat 1140 

gagtctttgg agtcggggat ggaggaggtt ctgcccctgt gaggtgttat acatgaccat 1200 

caaagtccta cgtcaagct 1219 

<210> 8 
<211> 191 
<212> PRT 

<213> Artificial Sequence 



<400> 8 

Leu Met Met Asn Phe Gin Pro Pro Ser Lys Ala Trp Arg Ala Ser Gin 

15 10 15 

Met Met Thr Phe Phe He Phe Leu Leu Phe Phe Pro Ser Phe Thr Gly 
15 20 25 30 

Val Leu Cys Thr Leu Ala lie Thr He Trp Arg Leu Lys Pro Ser Ala 

35 40 45 

Asp Cys Gly Pro Phe Arg Gly Leu Pro Leu Phe He His Ser He Tyr 
50 55 60 

20 Ser Trp He Asp Thr Leu Ser Thr Arg Pro Gly Tyr Leu Trp Val Val 
65 70 75 80 

Trp He Tyr Arg Asn Leu He Gly Ser Val His Phe Phe Phe He Leu 

85 90 95 

Thr Leu He Val Leu He He Thr Tyr Leu Tyr Trp Gin He Thr Glu 
25 100 105 110 

Gly Arg Lys He Met He Arg Leu Leu His Glu Gin He He Asn Glu 

115 120 125 

Gly Lys Asp Lys Met Phe Leu He Glu Lys Leu He Lys Leu Gin Asp 
130 135 140 

30 Met Glu Lys Lys Ala Asn Pro Ser Ser Leu Val Leu Glu Arg Arg Glu 
145 150 155 160 

Val Glu Gin Gin Gly Phe Leu His Leu Gly Glu His Asp Gly Ser Leu 

165 170 175 

Asp Leu Arg Ser Arg Arg Ser Val Gin Glu Gly Asn Pro Arg Ala 
35 180 185 190 

<210> 9 
<211> 10 
<212> PRT 

40 <213> Artificial Sequence 

<400> 9 

He Thr Glu Gly Arg Lys He Met He Arg 
15 10 

<210> 10 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

Leu Leu Met Asp Phe Val Phe Ser Leu 

1 5 

55 <210> 11 

<211> 9 
<212> PRT 

<213> Artificial Sequence 

60 <400> 11 

Phe Leu Leu Phe Phe Pro Ser Phe Thr 
1 5 

<210> 12 
65 <211> 9 

<212> PRT 



6 



WO 01/23417 



20 



<213> Artificial Sequence 

<400> 12 

Gin Met Met Thr Phe Phe lie Phe Leu 
5 1 5 

<210> 13 

<211> 9 

<212> PRT 

10 <213> Artificial Sequence 

<400> 13 

Met Met Thr Phe Phe lie Phe Leu Leu 
1 5 

15 

<210> 14 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 14 

Phe Leu lie Glu Lys Leu lie Lys Leu 
1 5 

25 <210> 15 

<211> 9 
<212> PRT 

<213> Artificial Sequence 

30 <400> 15 

Val Leu Leu lie Arg Asn lie Phe Leu 

1 5 

<210> 16 
35 <211> 9 

<212> PRT 

<213> Artificial Sequence 
<400> 16 

40 Leu Val Trp lie Gly lie Phe Phe Cys 
1 5 

<210> 17 
<211> 9 
45 <212> PRT 

<213> Artificial Sequence 

<400> 17 

Thr Leu Ala lie Thr lie Trp Arg Leu 
50 1 5 

<210> 18 
<211> 9 
<212> PRT 

55 <213> Artificial Sequence 

<400> 18 

Leu He Phe Cys Trp Asp Phe Thr Val 
1 * 5 



60 



65 



<210> 19 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 19 
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Phe Leu Gly Glu Phe Leu Arg Arg lie 
1 5 



10 



<210> 20 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 20 

Leu Leu Leu Pro Phe Val Val Ser Cys 
1 5 



15 



<210> 21 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



20 



<400> 21 
Leu Leu Thr Arg Phe 

1 5 



Ser Ala Tyr Met 



25 



30 



35 



<210> 22 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 22 

Lys Leu lie Phe Cys Trp Asp Phe Thr 
1 5 

<210> 23 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 23 

Leu Leu Leu Met Asp Phe Val Phe Ser Leu 
15 10 



40 



45 



<210> 24 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 24 

Leu Leu Met Asp Phe Val Phe Ser Leu Val 
1 5 10 



50 



55 



<210> 25 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 25 

Gin Met Met Thr Phe Phe lie Phe Leu Leu 
15 10 



60 



<210> 26 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



65 



<400> 26 

Tyr Leu Ala Glu Tyr Asn Leu Glu Phe Leu 
15 10 



8 
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<210> 27 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 27 

Lys Leu lie Phe Cys Trp Asp Phe Thr Val 
15 10 



10 



15 



<210> 
<211> 
<212> 
<213> 



28 
10 
PRT 

Artificial 



Sequence 



<400> 28 

Leu Leu Phe Phe Pro Ser Phe Thr Gly Val 
1 5 10 



20 



25 



<210> 29 
<211> 10 
<212> PRT 
<213> Artificial 



Sequence 



<400> 29 
Gin Leu Leu Thr Arg 
1 5 



Phe Ser Ala Tyr 



Met 
10 



30 



35 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 30 

Thr Leu Val Trp lie Gly lie Phe Phe Cys 
15 10 



40 



<210> 
<211> 
<212> 
<213> 



31 
10 
PRT 

Artificial 



Sequence 



45 



50 



55 



60 



<400> 31 

Ser Gin Met Met Thr Phe Phe lie Phe Leu 
15 10 

<210> 32 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 32 

Val Leu Leu Leu Pro Phe Val Val Ser Cys 
15 10 

<210> 33 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 33 

lie Leu Cys Tyr Tyr Trp Leu Asn Thr Val 
15 10 



65 



<210> 34 
<211> 10 
<212> PRT 



9 
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<213> Artificial Sequence 
<400> 34 

Thr Leu lie Val Leu lie lie Thr Tyr Leu 
1 5 10 



10 



<210> 35 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<400> 35 
Tyr Val Leu Leu lie 
1 5 



Arg Asn lie Phe 



Leu 
10 



15 



20 



25 



30 



<210> 36 
<211> 10 
<212> PRT 

<213> Artificial S&quQnce: 
<400> 36 

Phe Val Phe Ser Leu Val Asn Ser Phe Leu 
15 10 

<210> 37 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 37 

Leu Leu Leu Pro Phe Val Val Ser Cys lie 
15 10 



35 



40 



Leu 
1 



<210> 38 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<400> 38 
lie Gly Ser 



Val 
5 



His Phe Phe Phe 



lie 
10 



45 



50 



<210> 39 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 39 

Tyr Glu Met Pro Arg His Glu Val Tyr Val 
1 5 10 



55 



<210> 40 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



Leu 
1 



<400> 40 
Leu Thr Arg 



60 



Phe 
5 



Ser Ala Tyr Met 



Val 
10 



65 



<210> 41 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 41 



10 
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lie Tyr Ser Gly Gly lie Thr Lys Leu 
1 5 



10 



<210> 42 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 42 

Val Tyr Tyr Leu Ala Glu Tyr Asn Leu 
1 5 



15 



20 



<210> 43 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 43 

Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
1 5 



25 



<210> 44 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



30 



<400> 44 
lie Tyr Arg Leu Leu 
1 5 



Leu Met Asp Phe 



35 



<210> 45 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<400> 45 
Val Tyr Val Leu Leu 
1 5 



lie Arg Asn lie 



Phe 
10 



40 



45 



<210> 46 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 46 

Cys Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
15 10 



50 



55 



60 



65 



<210> 47 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 47 

Tyr Tyr Leu Ala Glu Tyr Asn Leu Glu Phe 
15 10 

<210> 48 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 48 

Met Leu Phe lie Met Phe Tyr Ser Lys 
1 5 



11 
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<210> 49 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 49 

Leu Leu lie Arg Asn lie Phe Leu Lys 
1 5 



10 



15 



<210> 50 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 50 

Gly Leu Gin Glu Phe Asp lie Ala Arg 

1 5 



20 



<210> 
<211> 
<212> 
<213> 



51 
10 
PRT 

Artificial 



Sequence 



25 



<400> 51 

Lys Met Phe Leu lie Glu Lys Leu lie Lys 
1 5 10 



30 



35 



<210> 52 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 52 

lie Met Leu Phe lie Met Phe Tyr Ser Lys 
1 5 10 



40 



<210> 53 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



45 



50 



55 



60 



<400> 53 

Lys Leu lie Lys Leu Gin Asp Met Glu Lys 
15 10 

<210> 54 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 54 

Val Leu Leu lie Arg Asn lie Phe Leu Lys 
15 10 

<210> 55 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 55 

Met Pro Arg His Glu Val Tyr Val Leu 
1 5 



65 



<210> 56 
<211> 10 
<212> PRT 



12 
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10 



15 



20 



<213> Artificial Sequence 
<400> 56 

Met Pro Arg His Glu Val Tyr Val Leu Leu 
15 10 

<210> 57 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 57 

Tyr Glu Met Pro Arg His Glu Val Tyr 
1 5 

<210> 58 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 58 

Trp Glu Thr Leu lie Gly Gin Asp He Tyr 
15 10 



25 



30 



<210> 59 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 59 

Val His Phe Phe Phe He Leu Thr Leu 
1 5 



35 



40 



<210> 60 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 60 

Leu Arg Arg He lie Gly Met Gin Leu 
1 5 



45 



50 



<210> 61 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 61 

Leu Leu Met Asp Phe Val Phe Ser Leu 
1 5 



55 



60 



65 



<210> 62 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 62 

Met Gin Leu He Thr Ser Leu Gly Leu 

1 5 

<210> 63 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 63 



13 
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Phe Val Phe Ser Leu Val Asn Ser Phe 
1 5 



10 



<210> 64 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 64 

Phe lie Leu Thr Leu lie Val Leu lie 
1 5 



15 



<210> 65 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



20 



<400> 65 
Trp Gin lie Thr 
1 



Glu Gly Arg Lys lie 
5 



25 



30 



35 



40 



45 



<210> 66 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 66 

Met Phe Tyr Ser Lys Asn lie Ser Leu 
1 5 

<210> 67 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 67 

Phe lie Met Phe Tyr Ser Lys Asn lie 
1 5 

<210> 68 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 68 

Trp Arg Ala Ser Gin Met Met Thr Phe 
1 5 



50 



55 



<210> 69 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 69 

Phe Phe Phe lie Leu Thr Leu lie Val 
1 5 



60 



65 



<210> 70 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 70 

Phe Arg Leu Val Glu Arg Tyr Glu Met 
1 5 



14 



WO 01/23417 



P qg4EP0O /O95O0 



<210> 71 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 71 

Phe Phe lie Leu Thr Leu lie Val Leu 
1 5 

<210> 72 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 72 

Tyr Ala Gin Thr Leu Val Trp lie Ala 
1 5 



15 
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SEQUENCE LISTING 

<110> COCHE, Thierry 

CASSERT, Jean-Pol 

GAULIS, Swann Romain Jean-Thomas 
VINALS Y DE BASSOLS Carlotta 

<120> Human Tumor-Associated LAK-4P Related 

Polynucleotides and Polypeptides and Their Uses 



<130> BC45263 

<150> GB 9923154.0 
<151> 1999-09-30 

<150> GB 0016839.3 
<151> 2000-07-07 

<160> 72 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 1 

tggggaggca gaaggcagac tgatcacttg aggccaggag tttgagacct catgtctaaa 
aaaaaaaaat tctgtgaggt gagttttatt gttattccct ctctacagat atggaaactg 120 
aggctgagaa tcagaaccat tcacaagaca aaaatccccc agttggcaga tccagggttg 
caagccaggc ctgtgcagcc ccaaaaccag tgcttgttta accactgtgt ggtgaccaca 
ccgctccagg ccaacagctt ggggctaagt cttcacgttg cctttcacca ttaaataata 
gggctgccct ttgttgaagc cctgcactcc cagtgacggc cataataacc ttcaggtgtt 
ctgctttctg ccttctctag catggccaag tatttccgga acaacttcat taatccccac 
atttactccg gagggatcac caagctgatc ttttgctggg acttcactgt cactcatgaa 
aaagctgtga agctaaaaca gaagaatctt agcactgaga taagggagaa cctgtcagag 
ctccgtcagg agaattccaa gttgacgttc aatcagctgc tgacccgctt ctctgcctac 
atggtagcct gggttgtctc tacaggagtg gccatagcct gctgtgcagc cgtttattac 
ctggctgagt acaacttaga gttcctgaag acacacagta accctggggc ggtgctgtta 
ctgcctttcg ttgtgtcctg cattaatctg gccgtgccat gcatctactc catgttcagg 
cttgtggaga ggtacgagat gccacggcac gaagtctacg ttctcctgat ccgaaacatc 
tttttgaaaa tatcaatcat tggcattctt tgttactatt ggctcaacac cgtggccctg 
tctggtgaag agtgttggga aaccctcatt ggccaggaca tctaccggct ccttctgatg 
gattttgtgt tctctttagt caattccttc ctgggggagt ttctgaggag aatcattggg 1020 
atgcaactga tcacaagtct tggccttcag gagtttgaca ttgccaggaa cgttctagaa 1080 
ctgatctatg cacaaactct ggtgtggatt ggcatcttct tctgccccct gctgcccttt 1140 
atccaaatga ttatgctttt catcatgttc tactccaaaa atatcagcct gatgatgaat 1200 
ttccagcctc cgagcaaagc ctggcgggcc tcacagatga tgactttctt catcttcttg 1260 



60 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 




ctctttttcc catccttcac cggggtcttg tgcaccctgg ccatcaccat ctggagattg 1320 

aagccttcag ctgactgtgg cccttttcga ggtctgcctc tcttcattca ctccatctac 1380 

agctggatcg acaccctaag tacacggcct ggctacctgt gggttgtttg gatctatcgg 1440 

aacctcattg gaagtgtgca cttctttttc atcctcaccc tcattgtgct aatcatcacc 1500 

tatctttact ggcagatcac agagggaagg aagattatga taaggctgct ccatgagcag 1560 

atcattaatg agggcaaaga taaaatgttc ctgatagaaa aattgatcaa gctgcaggat 1620 

atggagaaga aagcaaaccc cagctcactt gttctggaaa ggagagaggt ggagcaacaa 1680 

ggctttttgc atttggggga acatgatggc agtcttgact tgcgatctag aagatcagtt 1740 

caagaaggta atccaagggc ctgatgactc ttttggtaac cagacaccaa tcaaataagg 1800 

ggaggagacg aaaatggaat gatttcttcc atgccacctg tgcctttagg aactgcccag 1860 

aagaaaatcc aaggctttag ccaggagcgg aaactgacta ccatgtaatt atcaaagtaa 1920 

aattgggcat tccatgctat ttttaatacc tggattgctg atttttcaag acaaaatact 1980 

tggggttttc caataaagat tgttgtaata ttgaaatgag cctacaaaaa cctaggaaga 2040 

gataactagg gaataatgta tattatcttc aagaaatgtg tgcaggaatg attggttctt 2100 

agaaatctct cctgccagac ttcccagacc tggcaaaggt ttagaaactg ttgctaagaa 2160 

aagtggtcca tcctgaataa acatgtaata ctccagcagg gatatgaagc ctctgaattg 2220 

tagaacctgc atttatttgt gactttgaac taaagacatc ccccatgtcc caaaggtgga 2280 

atacaaccag aggtctcatc tctgaacttt cttgcgtact gattacatga gtctttggag 2340 

tcggggatgg aggaggttct gcccctgtga ggtgttatac atgaccatca aagtcctacg 2400 

tcaagct 2407 



<210> 2 
<211> 460 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 2 

Met Ala Lys Tyr 
1 

Gly Gly lie Thr 

20 

Glu Lys Ala Val 

35 

Glu Asn Leu Ser 
50 

Gin Leu Leu Thr 

65 

Thr Gly Val Ala 

Tyr Asn Leu Glu 

100 

Leu Leu Pro Phe 
115 

Tyr Ser Met Phe 
130 

Val Tyr Val Leu 
145 

Gly lie Leu Cys 

Glu Cys Trp Glu 

18 0 



Phe Arg Asn Asn 

5 

Lys Leu lie Phe 

Lys Leu Lys Gin 

40 

Glu Leu Arg Gin 

55 

Arg Phe Ser Ala 
70 

lie Ala Cys Cys 
85 

Phe Leu Lys Thr 

Val Val Ser Cys 

120 

Arg Leu Val Glu 
135 

Leu lie Arg Asn 
150 

Tyr Tyr Trp Leu 
165 

Thr Leu lie Gly 



Phe lie Asn Pro 
10 

Cys Trp Asp Phe 
25 

Lys Asn Leu Ser 

Glu Asn Ser Lys 

60 

Tyr Met Val Ala 
75 

Ala Ala Val Tyr 
90 

His Ser Asn Pro 
105 

lie Asn Leu Ala 

Arg Tyr Glu Met 

140 

lie Phe Leu Lys 
155 

Asn Thr Val Ala 
170 

Gin Asp lie Tyr 
185 



His He Tyr Ser 
15 

Thr Val Thr His 

30 

Thr Glu He Arg 
45 

Leu Thr Phe Asn 

Trp Val Val Ser 

80 

Tyr Leu Ala Glu 
95 

Gly Ala Val Leu 
110 

Val Pro Cys He 
125 

Pro Arg His Glu 

He Ser He He 

160 

Leu Ser Gly Glu 
175 

Arg Leu Leu Leu 
190 
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? 



Met 


Asp 


Phe 


Val 


Phe 


Ser 


Leu 


Val 


Asn 


Ser 


Phe 


Leu 


Gly 


Glu 


Phe 


Leu 




195 










200 










205 








Arg 


Arg 


He 


He 


Gly 


Met 


Gin 


Leu 


He 


Thr 


Ser 


Leu 


Gly 


Leu 


Gin 


Glu 


210 










215 










220 










Phe 


Asp 


He 


Ala 


Arg 


Asn 


Val 


Leu 


Glu 


Leu 


He 


Tyr 


Ala 


Gin 


Thr 


Leu 


225 








230 










235 










240 


Val 


Trp 


He 


Gly 


He 


Phe 


Phe 


Cys 


Pro 


Leu 


Leu 


Pro 


Phe 


He 


Gin 


Met 






245 










250 










255 




He 


Met 


Leu 


Phe 


He 


Met 


Phe 


Tyr 


Ser 


Lys 


Asn 


He 


Ser 


Leu 


Met 


Met 








260 










265 










270 






Asn 


Phe 


Gin 


Pro 


Pro 


Ser 


Lys 


Ala 


Trp 


Arg 


Ala 


Ser 


Gin 


Met 


Met 


Thr 






275 










280 










285 








Phe 


Phe 


He 


Phe 


Leu 


Leu 


Phe 


Phe 


Pro 


Ser 


Phe 


Thr 


Gly 


Val 


Leu 


Cys 




290 










295 










300 










Thr 


Leu 


Ala 


He 


Thr 


He 


Trp 


Arg 


Leu 


Lys 


Pro 


Ser 


Ala 


Asp 


Cys 


Gly 


305 










310 










315 










320 


Pro 


Phe 


Arg 


Gly 


Leu 


Pro 


Leu 


Phe 


He 


His 


Ser 


He 


Tyr 


Ser 


Trp 


He 






325 










330 










335 




Asp 


Thr 


Leu 


Ser 


Thr 


Arg 


Pro 


Gly 


Tyr 


Leu 


Trp 


Val 


Val 


Trp 


He 


Tyr 






340 










345 










350 






Arg 


Asn 


Leu 


He 


Gly 


Ser 


Val 


His 


Phe 


Phe 


Phe 


He 


Leu 


Thr 


Leu 


He 




355 










360 










365 








Val 


Leu 


He 


He 


Thr 


Tyr 


Leu 


Tyr 


Trp 


Gin 


He 


Thr 


Glu 


Gly Arg 


Lys 




370 










375 










380 










He 


Met 


He 


Arg 


Leu 


Leu 


His 


Glu 


Gin 


He 


He 


Asn 


Glu 


Gly 


Lys 


Asp 


385 








390 










395 










400 


Lys 


Met 


Phe 


Leu 


He 


Glu 


Lys 


Leu 


He 


Lys 


Leu 


Gin 


Asp 


Met 


Glu 


Lys 








405 










410 










415 




Lys 


Ala 


Asn 


Pro 


Ser 


Ser 


Leu 


Val 


Leu 


Glu 


Arg 


Arg 


Glu 


Val 


Glu 


Gin 






420 










425 










430 






Gin 


Gly 


Phe 


Leu 


His 


Leu 


Gly 


Glu 


His 


Asp 


Gly 


Ser 


Leu 


Asp 


Leu 


Arg 




435 










440 










445 








Ser 


Arg 


Arg 


Ser 


Val 


Gin 


Glu 


Gly 


Asn 


Pro 


Arg 


Ala 












450 








455 










460 











<210> 3 
<211> 2521 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 3 

tggggaggca gaaggcagac tgatcacttg aggccaggag tttgagacct catgtctaaa 
aaaaaaaaat tctgtgaggt gagttttatt gttattccct ctctacagat atggaaactg 120 
aggctgagaa tcagaaccat tcacaagaca aaaatccccc agttggcaga tccagggttg 
caagccaggc ctgtgcagcc ccaaaaccag tgcttgttta accactgtgt ggtgaccaca 
ccgctccagg ccaacagctt ggggctaagt cttcacgttg cctttcacca ttaaataata 
gggctgccct ttgttgaagc cctgcactcc cagtgacggc cataataacc ttcaggtgtt 
ctgctttctg ccttctctag catggccaag tatttccgga acaacttcat taatccccac 
atttactccg gagggatcac caagctgatc ttttgctggg acttcactgt cactcatgaa 
aaagctgtga agctaaaaca gaagaatctt agcactgaga taagggagaa cctgtcagag 
ctccgtcagg agaattccaa gttgacgttc aatcagctgc tgacccgctt ctctgcctac 



60 



180 
240 
300 
360 
420 
480 
540 
600 
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<400> 4 






























Met 


Ala 


Lys 


Tyr 


Phe 


Arg 


Asn 


Asn 


Phe 


He 


Asn 


Pro 


His 


He 


Tyr 


Ser 


1 






5 










10 










15 




Gly 


Gly 


He 


Thr 


Lys 


Leu 


He 


Phe 


Cys 


Trp 


Asp 


Phe 


Thr 


Val 


Thr 


His 




20 










25 










30 






Glu 


Lys 


Ala 


Val 


Lys 


Leu 


Lys 


Gin 


Lys 


Asn 


Leu 


Ser 


Thr 


Glu 


He 


Arg 




35 










40 










45 








Glu 


Asn 


Leu 


Ser 


Glu 


Leu 


Arg 


Gin 


Glu 


Asn 


Ser 


Lys 


Leu 


Thr 


Phe 


Asn 




50 










55 










60 










Gin 


Leu 


Leu 


Thr 


Arg 


Phe 


Ser 


Ala 


Tyr 


Met 


Val 


Ala 


Trp 


Val 


Val 


Ser 


65 










70 










75 










80 


Thr 


Gly 


Val 


Ala 


He 


Ala 


Cys 


Cys 


Ala 


Ala 


Val 


Tyr 


Tyr 


Leu 


Ala 


Glu 



atggtagcct gggttgtctc tacaggagtg gccatagcct gctgtgcagc cgtttattac 660 

ctggctgagt acaacttaga gttcctgaag acacacagta accctggggc ggtgctgtta 720 

ctgcctttcg ttgtgtcctg cattctggcc gtgccatgca tctactccat gttcaggctt 780 

gtggagaggt acgagatgcc acggcacgaa gtctacgttc tcctgatccg caggggattg 840 

atgtagttct caagtatggg atgtacagat gggcaggcag tgcacgcaca aaggctcctg 900 

ggctgaggac gggactgaaa tcatccagcg ttccccttag tcaagctaaa catctttttg 960 

aaaatatcaa tcattggcat tctttgttac tattggctca acaccgtggc cctgtctggt 1020 

gaagagtgtt gggaaaccct cattggccag gacatctacc ggctccttct gatggatttt 1080 

gtgttctctt tagtcaattc cttcctgggg gagtttctga ggagaatcat tgggatgcaa 1140 

ctgatcacaa gtcttggcct tcaggagttt gacattgcca ggaacgttct agaactgatc 1200 

tatgcacaaa ctctggtgtg gattggcatc ttcttctgcc ccctgctgcc ctttatccaa 1260 

atgattatgc ttttcatcat gttctactcc aaaaatatca gcctgatgat gaatttccag 1320 

cctccgagca aagcctggcg ggcctcacag atgatgactt tcttcatctt cttgctcttt 1380 

ttcccatcct tcaccggggt cttgtgcacc ctggccatca ccatctggag attgaagcct 1440 

tcagctgact gtggcccttt tcgaggtctg cctctcttca ttcactccat ctacagctgg 1500 

atcgacaccc taagtacacg gcctggctac ctgtgggttg tttggatcta tcggaacctc 1560 

attggaagtg tgcacttctt tttcatcctc accctcattg tgctaatcat cacctatctt 1620 

tactggcaga tcacagaggg aaggaagatt atgataaggc tgctccatga gcagatcatt 168 0 

aatgagggca aagataaaat gttcctgata gaaaaattga tcaagctgca ggatatggag 1740 

aagaaagcaa accccagctc acttgttctg gaaaggagag aggtggagca acaaggcttt 1800 

ttgcatttgg gggaacatga tggcagtctt gacttgcgat ctagaagatc agttcaagaa 1860 

ggtaatccaa gggcctgatg actcttttgg taaccagaca ccaatcaaat aaggggagga 1920 

gacgaaaatg gaatgatttc ttccatgcca cctgtgcctt taggaactgc ccagaagaaa 

atccaaggct ttagccagga gcggaaactg actaccatgt aattatcaaa gtaaaattgg 

gcattccatg ctatttttaa tacctggatt gctgattttt caagacaaaa tacttggggt 

tttccaataa agattgttgt aatattgaaa tgagcctaca aaaacctagg aagagataac 2160 

tagggaataa tgtatattat cttcaagaaa tgtgtgcagg aatgattggt tcttagaaat 2220 

ctctcctgcc agacttccca gacctggcaa aggtttagaa actgttgcta agaaaagtgg 2280 

tccatcctga ataaacatgt aatactccag cagggatatg aagcctctga attgtagaac 2340 

ctgcatttat ttgtgacttt gaactaaaga catcccccat gtcccaaagg tggaatacaa 2400 

ccagaggtct catctctgaa ctttcttgcg tactgattac atgagtcttt ggagtcgggg 2460 

atggaggagg ttctgcccct gtgaggtgtt atacatgacc atcaaagtcc tacgtcaagc 2520 

t - 2521 

<210> 4 
<211> 154 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 



1980 
2040 
2100 
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85 90 95 

Tyr Asn Leu Glu Phe Leu Lys Thr His Ser Asn Pro Gly Ala Val Leu 

100 105 110 

Leu Leu Pro Phe Val Val Ser Cys lie Leu Ala Val Pro Cys He Tyr 

115 120 125 

Ser Met Phe Arg Leu Val Glu Arg Tyr Glu Met Pro Arg His Glu Val 

130 135 140 

Tyr Val Leu Leu He Arg Arg Gly Leu Met 
145 150 

<210> 5 
<211> 1960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 5 

atcttttgct gggacttcac tgtcactcat gaaaaagctg tgaagctaaa acagaagaat 60 
cttagcactg agataaggga gaacctgtca gagctccgtc aggagaattc caagttgacg 120 
ttcaatcagc tgctgacccg cttctctgcc tacatggtag cctgggttgt ctctacagga 180 
gtggccatag cctgctgtgc agccgtttat tacctggctg agtacaactt agagttcctg 240 
aagacacaca gtaaccctgg ggcggtgctg ttactgcctt tcgttgtgtc ctgcattaat 
ctggccgtgc catgcatcta ctccatgttc aggcttgtgg agaggtacga gatgccacgg 
cacgaagtct acgttctcct gatccgaaac atctttttga aaatatcaat cattggcatt 
ctttgttact attggctcaa caccgtggcc ctgtctggtg aagagtgttg ggaaaccctc 
attggccagg acatctaccg gctccttctg atggattttg tgttctcttt agtcaattcc 
ttcctggggg agtttctgag gagaatcatt gggatgcaac tgatcacaag tcttggcctt 
caggagtttg acattgccag gaacgttcta gaactgatct atgcacaaac tctggtgtgg 
attggcatct tcttctgccc cctgctgccc tttatccaaa tgattatgct tttcatcatg 
ttctactcca aaaatatcag cctgatgatg aatttccagc ctccgagcaa agcctggcgg 
gcctcacaga tgatgacttt cttcatcttc ttgctctttt tcccatcttt caccggggtc 
ttgtgcaccc tggccatcac catctggaga ttgaagcctt cagctgactg tggccctttt 
cgaggtctgc ctctcttcat tcactccatc tacagctgga tcgacaccct aagtacacgg 
cctggctacc tgtgggttgt ttggatctat cggaacctca ttggaagtgt gcacttcttt 1020 
ttcatcctca ccctcattgt gctgatcatc acctatcttt actggcagat cacagaggga 1080 
aggaagatta tgataaggct gctccatgag cagatcatta atgagggcaa agataaaatg 1140 
ttcctgatag aaaaattgat caagctgcag gatatggaga agaaagcaaa ccccagctca 1200 
cttgttctgg aaaggagaga ggtggagcaa caaggctttt tgcatttggg ggaacatgat 1260 
ggcagtcttg acttgcgatc tagaagatca gttcaagaag gtaatccaag ggcctgatga 1320 
ctcttttggt aaccagacac caatcaaata aggggaggag atgaaaatgg aatgatttct 1380 
tccatgccac ctgtgccttt aggaactgcc cagaagaaaa tccaaggctt tagccaggag 1440 
cggaaactga ctaccatgta attatcaaag taaaattggg cattccatgc tatttttaat 1500 
acctggattg ctgatttttc aagacaaaat acttggggtt ttccaataaa gattgttgta 1560 
atattgaaat gagcctacaa aaacctagga agagataact agggaataat gtatattatc 
ttcaagaaat gtgtgcagga atgattggtt cttagaaatc tctcctgcca gacttcccag " 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1680 



acctggcaaa ggtttagaaa ctgttgctaa gaaaagtggt ccatcctgaa taaacatgta 1740 

atactccagc agggatatga agcctctgaa ttgtagaacc tgcatttatt tgtgactttg 1800 

aactaaagac atcccccatg tcccaaaggt ggaatacaac cagaggtctc atctctgaac 1860 

tttcttgcgt actgattaca tgagtctttg gagtcgggga tggaggaggt tctgcccctg 1920 

tgaggtgtta tacatgacca tcaaagtcct acgtcaagct 1960 



<210> 6 



<211> 438 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 6 

lie Phe Cys Trp 
1 

Lys Gin Lys Asn 

20 

Arg Gin Glu Asn 
35 

Ser Ala Tyr Met 
50 

Cys Cys Ala Ala 
65 

Lys Thr His Ser 

Ser Cys lie Asn 

100 

Val Glu Arg Tyr 
115 

Arg Asn lie Phe 
130 

Trp Leu Asn Thr 
145 

lie Gly Gin Asp 

Leu Val Asn Ser 

180 

Gin Leu lie Thr 
195 

Val Leu Glu Leu 
210 

Phe Cys Pro Leu 

225 

Phe Tyr Ser Lys 

Lys Ala Trp Arg 

260 

Phe Phe Pro Ser 
275 

Trp Arg Leu Lys 
290 

Leu Phe lie His 

305 

Pro Gly Tyr Leu 

Val His Phe Phe 

340 

Leu Tyr Trp Gin 

355 



Asp Phe Thr Val 
5 

Leu Ser Thr Glu 

Ser Lys Leu Thr 

40 

Val Ala Trp Val 

55 

Val Tyr Tyr Leu 
70 

Asn Pro Gly Ala 
85 

Leu Ala Val Pro 

Glu Met Pro Arg 

120 

Leu Lys lie Ser 
135 

Val Ala Leu Ser 
150 

lie Tyr Arg Leu 

165 

Phe Leu Gly Glu 

Ser Leu Gly Leu 

200 

lie Tyr Ala Gin 
215 

Leu Pro Phe lie 

230 

Asn lie Ser Leu 
245 

Ala Ser Gin Met 

Phe Thr Gly Val 

280 

Pro Ser Ala Asp 

295 

Ser lie Tyr Ser 
310 

Trp Val Val Trp 
325 

Phe lie Leu Thr 

lie Thr Glu Gly 

360 



Thr His Glu Lys 
10 

lie Arg Glu Asn 

25 

Phe Asn Gin Leu 

Val Ser Thr Gly 

60 

Ala Glu Tyr Asn 
75 

Val Leu Leu Leu 
90 

Cys lie Tyr Ser 
105 

His Glu Val Tyr 

He He Gly He 

140 

Gly Glu Glu Cys 
155 

Leu Leu Met Asp 
170 

Phe Leu Arg Arg 
185 

Gin Glu Phe Asp 

Thr Leu Val Trp 

220 

Gin Met He Met 

235 

Met Met Asn Phe 
250 

Met Thr Phe Phe 

265 

Leu Cys Thr Leu 

Cys Gly Pro Phe 

300 

Trp He Asp Thr 
315 

He Tyr Arg Asn 
330 

Leu He Val Leu 
345 

Arg Lys He Met 



Ala Val Lys Leu 
15 

Leu Ser Glu Leu 
30 

Leu Thr Arg Phe 
45 

Val Ala He Ala 

Leu Glu Phe Leu 

80 

Pro Phe Val Val 
95 

Met Phe Arg Leu 
110 

Val Leu Leu He 
125 

Leu Cys Tyr Tyr 

Trp Glu Thr Leu 

160 

Phe Val Phe Ser 
175 

He He Gly Met 
190 

He Ala Arg Asn 

205 

He Gly He Phe 

Leu Phe He Met 

240 

Gin Pro Pro Ser 
255 

He Phe Leu Leu 
270 

Ala He Thr He 
285 

Arg Gly Leu Pro 

Leu Ser Thr Arg 

320 

Leu He Gly Ser 
335 

He He Thr Tyr 
350 

He Arg Leu Leu 

365 
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His Glu Gin He He Asn Glu Gly Lys Asp Lys Met Phe Leu He Glu 

370 375 380 

Lys Leu He Lys Leu Gin Asp Met Glu Lys Lys Ala Asn Pro Ser Ser 
385 390 395 400 

Leu Val Leu Glu Arg Arg Glu Val Glu Gin Gin Gly Phe Leu His Leu 

405 410 415 

Gly Glu His Asp Gly Ser Leu Asp Leu Arg Ser Arg Arg Ser Val Gin 

420 425 430 

Glu Gly Asn Pro Arg Ala 
435 

<210> 7 
<211> 1219 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 7 

ctgatgatga atttccagcc tccgagcaaa gcctggcggg cctcacagat gatgactttc 60 
ttcatcttct tgctcttttt cccatctttc accggggtct tgtgcaccct ggccatcacc 120 



180 



atctggagat tgaagccttc agctgactgt ggcccttttc gaggtctgcc tctcttcatt 
cactccatct acagctggat cgacacccta agtacacggc ctggctacct gtgggttgtt 240 
tggatctatc ggaacctcat tggaagtgtg cacttctttt tcatcctcac cctcattgtg 300 
ctgatcatca cctatcttta ctggcagatc acagagggaa ggaagattat gataaggctg 360 
ctccatgagc agatcattaa tgagggcaaa gataaaatgt tcctgataga aaaattgatc 420 
aagctgcagg atatggagaa gaaagcaaac cccagctcac ttgttctgga aaggagagag 480 
gtggagcaac aaggcttttt gcatttgggg gaacatgatg gcagtcttga cttgcgatct 540 
agaagatcag ttcaagaagg taatccaagg gcctgatgac tcttttggta accagacacc 600 
aatcaaataa ggggaggaga tgaaaatgga atgatttctt ccatgccacc tgtgccttta 660 
aqaactqccc agaagaaaat ccaaggcttt agccaggagc ggaaactgac taccatgtaa 720 

780 
840 
900 
960 



ttatcaaagt aaaattgggc attccatgct atttttaata cctggattgc tgatttttca 
agacaaaata cttggggttt tccaataaag attgttgtaa tattgaaatg agcctacaaa 
aacctaggaa gagataacta gggaataatg tatattatct tcaagaaatg tgtgcaggaa 
tgattggttc ttagaaatct ctcctgccag acttcccaga cctggcaaag gtttagaaac 
tgttgctaag aaaagtggtc catcctgaat aaacatgtaa tactccagca gggatatgaa 1020 
gcctctgaat tgtagaacct gcatttattt gtgactttga actaaagaca tcccccatgt 1080 
cccaaaggtg gaatacaacc agaggtctca tctctgaact ttcttgcgta ctgattacat 1140 
gagtctttgg agtcggggat ggaggaggtt ctgcccctgt gaggtgttat acatgaccat 1200 
caaagtccta cgtcaagct 1219 

<210> 8 
<211> 191 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 8 

Leu Met Met Asn Phe Gin Pro Pro Ser Lys Ala Trp Arg Ala Ser Gin 

15 10 15 

Met Met Thr Phe Phe He Phe Leu Leu Phe Phe Pro Ser Phe Thr Gly 
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20 25 30 



Val 


Leu 


Cys 
35 


Thr 


Leu 


Ala 


He 


Thr 
40 


He 


Trp 


Arg 


Leu 


Lys 
45 


Pro 


Ser 


Ala 


Asp 


Cys 
50 


Gly 


Pro 


Phe 


Arg 


Gly 
55 


Leu 


Pro 


Leu 


Phe 


He 

60 


His 


Ser 


He 


Tyr 


Ser 


Trp 


lie 


Asp 


Thr 


Leu 


Ser 


Thr 


Arg 


Pro 


Gly 


Tyr 


Leu 


Trp 


Val 


Val 


65 








70 










75 










80 


Trp 

IT 


lie 


Tyr 


Arg 


Asn 


Leu 


He 


Gly 


Ser 


Val 


His 


Phe 


Phe 


Phe 


He 


Leu 






85 










90 










95 




Thr 


Leu 


He 


Val 
100 


Leu 


He 


He 


Thr 


Tyr 
105 


Leu 


Tyr 


Trp 


Gin 


He 
110 


Thr 


Glu 


Gly 


Arg 


Lys 
115 


He 


Met 


He 


Arg 


Leu 
120 


Leu 


His 


Glu 


Gin 


He 
125 


He 


Asn 


Glu 


Gly 


Lys 
130 


Asp 


Lys 


Met 


Phe 


Leu 
135 


He 


Glu 


Lys 


Leu 


He 
140 


Lys 


Leu 


Gin 


Asp 


Met 


Glu 


Lys 


Lys 


Ala 


Asn 


Pro 


Ser 


Ser 


Leu 


Val 


Leu 


Glu 


Arg 


Arg 


Glu 


145 






150 










155 










160 


Val 


Glu 


Gin 


Gin 


Gly 
165 


Phe 


Leu 


His 


Leu 


Gly 
170 


Glu 


His 


Asp 


Gly 


Ser 
175 


Leu 


Asp 


Leu 


Arg 


Ser 


Arg 


Arg 


Ser 


Val 


Gin 


Glu 


Gly 


Asn 


Pro 


Arg 


Ala 








180 










185 










190 







<210> 9 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 9 

He Thr Glu Gly Arg Lys He Met He Arg 
15 10 

<210> 10 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 10 

Leu Leu Met Asp Phe Val Phe Ser Leu 

1 5 

<210> li 

<211> 9 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 11 

Phe Leu Leu Phe Phe Pro Ser Phe Thr 
1 5 

<210> 12 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 12 

Gin Met Met Thr Phe Phe lie Phe Leu 
1 5 

<210> 13 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 13 

Met Met Thr Phe Phe lie Phe Leu Leu 
1 5 

<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 14 

Phe Leu lie Glu Lys Leu lie Lys Leu 
1 5 

<210> 15 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 15 

Val Leu Leu lie Arg Asn lie Phe Leu 
1 5 



<210> 16 
<211> 9 



4...H y„j- 'MM ?! J -,j„.j. .„„-j3 ; J: 4Ji Sj.J} iL.-J 



<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 16 

Leu Val Trp lie Gly lie Phe Phe Cys 
1 5 

<210> 17 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 17 

Thr Leu Ala lie Thr lie Trp Arg Leu 

1 5 

<210> 18 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 18 

Leu lie Phe Cys Trp Asp Phe Thr Val 
1 5 

<210> 19 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 19 

Phe Leu Gly Glu Phe Leu Arg Arg lie 
1 5 

<210> 20 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 20 

Leu Leu Leu Pro Phe Val Val Ser Cys 
1 5 

<210> 21 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 21 

Leu Leu Thr Arg Phe Ser Ala Tyr Met 
1 5 

<210> 22 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<223> primer 
<400> 22 

Lys Leu lie Phe Cys Trp Asp Phe Thr 
1 5 

<210> 23 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 23 

Leu Leu Leu Met Asp Phe Val Phe Ser Leu 
15 10 

<210> 24 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 24 

Leu Leu Met Asp Phe Val Phe Ser Leu Val 
15 10 

<210> 25 
<211> 10 



<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 25 

Gin Met Met Thr Phe Phe lie Phe Leu Leu 
15 10 

<210> 26 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 26 

Tyr Leu Ala Glu Tyr Asn Leu Glu Phe Leu 
15 10 

<210> 27 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 27 

Lys Leu lie Phe Cys Trp Asp Phe Thr Val 
15 10 

<210> 28 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 28 

Leu Leu Phe Phe Pro Ser Phe Thr Gly Val 
15 10 

<210> 29 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 29 

Gin Leu Leu Thr Arg Phe Ser Ala Tyr Met 
1 5 10 



<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 30 

Thr Leu Val Trp lie Gly lie Phe Phe Cys 
15 10 

<210> 31 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 31 

Ser Gin Met Met Thr Phe Phe lie Phe Leu 
15 10 

<210> 32 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 32 

Val Leu Leu Leu Pro Phe Val Val Ser Cys 
15 10 

<210> 33 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 33 

lie Leu Cys Tyr Tyr Trp Leu Asn Thr Val 
15 10 

<210> 34 
<211> 10 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 34 

Thr Leu lie Val Leu lie lie Thr Tyr Leu 
15 10 

<210> 35 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> , 
<223> primer 

<400> 35 

Tyr Val Leu Leu He Arg Asn He Phe Leu 
15 10 

<210> 36 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 36 

Phe Val Phe Ser Leu Val Asn Ser Phe Leu 
1 5 10 

<210> 37 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 37 

Leu Leu Leu Pro Phe Val Val Ser Cys He 
15 10 

<210> 38 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 38 

Leu He Gly Ser Val His Phe Phe Phe lie 
15 10 

<210> 39 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 39 

Tyr Glu Met Pro Arg His Glu Val Tyr Val 
15 10 

<210> 40 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 40 

Leu Leu Thr Arg Phe Ser Ala Tyr Met Val 
15 10 

<210> 41 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 41 

lie Tyr Ser Gly Gly lie Thr Lys Leu 
1 5 

<210> 42 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 42 

Val Tyr Tyr Leu Ala Glu Tyr Asn Leu 
1 1 J 5 



<210> 43 
<211> 9 
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<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 43 

Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
1 5 

<210> 44 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 44 

lie Tyr Arg Leu Leu Leu Met Asp Phe 
1 5 

<210> 45 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 45 

Val Tyr Val Leu Leu lie Arg Asn lie Phe 
15 10 

<210> 46 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 46 

Cys Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
15 10 

<210> 47 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 47 

Tyr Tyr Leu Ala Glu Tyr Asn Leu Glu Phe 
15 10 

<210> 48 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 48 

Met Leu Phe lie Met Phe Tyr Ser Lys 
1 5 

<210> 49 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 49 

Leu Leu lie Arg Asn lie Phe Leu Lys 
1 5 

<210> 50 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 50 

Gly Leu Gin Glu Phe Asp lie Ala Arg 
1 5 

<210> 51 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 51 

Lys Met Phe Leu lie Glu Lys Leu lie Lys 
15 10 



<210> 52 
<211> 10 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 52 

lie Met Leu Phe lie Met Phe Tyr Ser Lys 
15 10 

<210> 53 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 53 

Lys Leu lie Lys Leu Gin Asp Met Glu Lys 
15 10 

<210> 54 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 54 

Val Leu Leu lie Arg Asn lie Phe Leu Lys 
15 10 

<210> 55 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 55 

Met Pro Arg His Glu Val Tyr Val Leu 
1 5 

<210> 56 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 
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<400> 56 

Met Pro Arg His Glu Val Tyr Val Leu Leu 
15 10 

<210> 57 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 57 

Tyr Glu Met Pro Arg His Glu Val Tyr 
1 5 

<210> 58 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 58 

Trp Glu Thr Leu lie Gly Gin Asp lie Tyr 
15 10 

<210> 59 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 59 

Val His Phe Phe Phe lie Leu Thr Leu 
1 5 

<210> 60 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 60 

Leu Arg Arg lie lie Gly Met Gin Leu 
1 5 

<210> 61 
<211> 9 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 61 

Leu Leu Met Asp Phe Val Phe Ser Leu 

1 5 

<210> 62 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 62 

Met Gin Leu lie Thr Ser Leu Gly Leu 
1 5 

<210> 63 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 63 

Phe Val Phe Ser Leu Val Asn Ser Phe 
1 5 

<210> 64 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 64 

Phe lie Leu Thr Leu lie Val Leu lie 
1 5 

<210> 65 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 65 

Trp Gin lie Thr Glu Gly Arg Lys lie 
1 5 



<210> 66 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 66 

Met Phe Tyr Ser Lys Asn lie Ser Leu 
1 5 

<210> 67 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 67 

Phe lie Met Phe Tyr Ser Lys Asn lie 
1 5 

<210> 68 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 68 

Trp Arg Ala Ser Gin Met Met Thr Phe 

1 5 

<210> 69 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 69 

Phe Phe Phe lie Leu Thr Leu lie Val 
1 5 

<210> 70 
<211> 9 
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<212> PRT 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 70 

Phe Arg Leu Val Glu Arg Tyr Glu Met 
1 5 

<210> 71 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 71 

Phe Phe lie Leu Thr Leu lie Val Leu 
1 5 

<210> 72 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 72 

Tyr Ala Gin Thr Leu Val Trp lie Ala 
1 5 
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